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HUMAN CALCIUM CHANNEL COMPOSITIONS AND METHODS USING THEM 

TECHNICAL FIELD 

The present invention relates to molecular biology and 
pharmacology. More particularly, the invention relates to 
calcium channel compositions and methods of making and using 
the same. 

BACKGROUND OF THE INVENTION 

Calcium channels are membrane -spanning, multi-subunit 
proteins that allow controlled entry of Ca 2 * ions into cells 
from the extracellular fluid. Cells throughout the animal 
kingdom, and at least some bacterial, fungal and plant cells, 
possess one or more types of calcium channel. 

The most common type of calcium channel is voltage 
dependent. "Opening" of a voltage -dependent channel to allow 
an influx of Ca 2+ ions into the cells requires a depolarization 
to a certain level of the potential difference between the 
inside of the cell bearing the channel and the extracellular 
medium bathing the cell. The rate of influx of Ca 2 * into the 
cell depends on this potential difference. All "excitable" 
cells in animals, such as neurons of the central nervous 
system (CNS) , peripheral nerve cells and muscle cells, 
including those of skeletal muscles, cardiac muscles, and 
venous and arterial smooth muscles, have voltage-dependent 
calcium channels. 

Multiple types of calcium channels have been identified 
in mammalian cells from various tissues, including skeletal 
muscle, cardiac muscle, lung, smooth muscle and brain, [see, 
e.g., Bean, B. P. (1989) Ann. Rev. Physiol. 51:367-384 and Hess, 
P. (1990) Ann. -Rev. Neurosci . 55:337]. The different types of 
calcium channels have been broadly categorized into four 
classes, L- , T-, N- , and P-type, distinguished by current 
kinetics, holding potential sensitivity and sensitivity to 
calcium channel agonists and antagonists. 

Calcium channels are multisubunit proteins that contain 
two large subunits, designated a l and or 2 , which have molecular 
weights between about 130 and about 200 kilodaltons ("kD"), 
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and one to three different smaller subunits of less than about 
60 kD in molecular weight. At least one of the larger 
subunits and possibly some of the smaller subunits are 
glycosylated. Some of the subunits are capable of being 
phosphorylated. The a x subunit has a molecular weight of 
about 150 to about 170 kD when analyzed by sodium 
dodecylsulfate (SDS) -polyacrylamide gel electrophoresis (PAGE) 
after isolation from mammalian muscle tissue and has specific 
binding sites for various 1, 4-dihydropyridines (DHPs) and 
phenyl alky 1 amines. Under non-reducing conditions (in the 
presence of N-ethylmaleimide) , the a 2 subunit migrates in 
SDS - PAGE as a band corresponding to a molecular weight of 
about 160-190 kD. Upon reduction, a large fragment and 
smaller fragments are released. The (3 subunit of the rabbit 
skeletal muscle calcium channel is a phosphorylated protein 
that has a molecular weight of 52-65 'kD as determined by SDS- 
PAGE analysis. This subunit is insensitive to reducing 
conditions. The y subunit of the calcium channel, which is 
not observed in all purified preparations, appears to be a 
glycoprotein with an apparent molecular weight of 30-33 kD, as 
determined by SDS-PAGE analysis. 

In order to study calcium channel structure and function, 
large amounts of pure channel protein are needed. Because of 
the complex nature of these multisubunit proteins, the varying 
concentrations of calcium channels in tissue sources of the 
protein, the presence of mixed populations of calcium channels 
in tissues, difficulties in obtaining tissues of interest, and 
the modifications of the native protein that can occur during 
the isolation procedure, it is extremely difficult to obtain 
large amounts of highly purified, completely intact calcium 
channel protein. 

Characterization of a particular type of calcium channel 
by analysis of whole cells is severely restricted by the 
presence of mixed populations of different types of calcium 
channels in the majority of cells. Single-channel recording 
methods that are used to examine individual calcium channels 
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do not reveal any information regarding the molecular 
structure or biochemical composition of the channel . 
Furthermore, in performing this type of analysis, the channel 
is isolated from other cellular constituents that might be 
important for natural functions and pharmacological 
interactions . 

Characterization of the gene or genes encoding calcium 
channels provides another means of characterization of 
different types of calcium channels. The amino acid sequence 
determined from a complete nucleotide sequence of the coding 
region of a gene encoding a calcium channel protein represents 
the primary structure of the protein. Furthermore, secondary 
structure of the calcium channel protein and the relationship 
of the protein to the membrane may be predicted based on 
analysis of the primary structure. For instance, hydropathy 
plots of the subunit protein of the rabbit skeletal muscle 
calcium channel indicate that it contains four internal 
repeats, each containing six putative transmembrane regions 
[Tanabe, T. et al . (1987) Mature 328:313]. 

Because calcium channels are present in various tissues 
and have a central role in regulating intracellular calcium 
ion concentrations, they are implicated in a number of vital 
processes in animals, including neurotransmitter release, 
muscle contraction, pacemaker activity, and secretion of 
hormones and other substances. These processes appear to be 
involved in numerous human disorders, such as CNS and 
cardiovascular diseases. Calcium channels, thus, are also 
implicated in numerous disorders. A number of compounds 
useful for treating various cardiovascular diseases in 
animals, including humans, are thought to exert their 
beneficial effects by modulating functions of voltage- 
» dependent calcium channels present in cardiac and/or vascular 

smooth muscle. Many of these compounds bind to calcium 
channels and block, or reduce the rate of, influx of Ca 2 * into 
the cells in response to depolarization of the cell membrane. 
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The results of studies of recombinant expression of 
rabbit calcium channel a x subunit -encoding cDNA clones and 
transcripts of the cDNA clones indicate that the a x subunit 
forms the pore through which calcium enters cells. The 
relevance of the barium currents generated in these 
recombinant cells to the actual current generated by calcium 
channels containing as one component the respective a 1 
subunits in vivo is unclear. In order to completely and 
accurately characterize and evaluate different calcium channel 
types, however, it is essential to examine the functional 
properties of recombinant channels containing all of the 
subunits as found in vivo. 

In order to conduct this examination and to fully 
understand calcium channel structure and function, it is 
critical to identify and characterize as many calcium channel 
subunits as possible. Also in order to prepare recombinant 
cells for use in identifying compounds that interact with 
calcium channels, it is necessary to be able to produce cells 
that express uniform populations of calcium channels 
containing defined subunits. 

An understanding of the pharmacology of compounds that 
interact with calcium channels in other organ systems, such as 
the CNS, may aid in the rational design of compounds that 
specifically interact with subtypes of human calcium channels 
to have desired therapeutic effects, such as in the treatment 
of neurodegenerative and cardiovascular disorders. Such 
understanding and the ability to rationally design 
therapeutically effective compounds, however, have been 
hampered by an inability to independently determine the types 
of human calcium channels and the molecular nature of 
individual subtypes, particularly in the CNS, and by the 
unavailability of pure preparations of specific channel 
subtypes to use for evaluation of the specificity of calcium 
channel-ef f ecting compounds. Thus, identification of DNA 
encoding human calcium channel subunits and the use of such 
DNA for expression of calcium channel subunits and functional 
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calcium channels would aid in screening and designing 
therapeutically effective compounds. 

Therefore, it is an object herein, to provide DNA 
encoding specific calcium channel subunits and to provide 
eukaryotic cells bearing recombinant tissue-specific or 
subtype- specific calcium channels. It is also an object to 
provide assays for identification of potentially therapeutic 
compounds that act as calcium channel antagonists and 
agonists . 
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SUMMARY OP THE INVENTION 

Isolated and purified nucleic acid fragments that encode 
human calcium channel subunits are provided. DNA encoding a x 
subunits of a human calcium channel, and RNA, encoding such 
subunits, made upon transcription of such DNA are provided. 
In particular, DNA fragments encoding <* 2 subunits of voltage- 
dependent human calcium channels (VDCCs) type A, type B (also 
referred to as VDCC IV), type C (also referred to as VDCC II) 
type D (also referred to as VDCC III) and type E are provided. 

DNA encoding <* 1A , a 1B , a ic , a 1D and a 1E subunits is provided. 
DNA encoding an cx 10 subunit that includes the amino acids 
substantially as set forth as residues 10-2161 of SEQ ID No. 
1 is provided. DNA encoding an a 1J3 subunit that includes 
substantially the amino acids set forth as amino acids 1-34 in 
SEQ ID No. 2 in place of amino acids 373-406 of SEQ ID No. 1 
is also provided. DNA encoding an a lc subunit that includes 
the amino acids substantially as set forth in SEQ ID No. 3 or 
SEQ ID No. 6 and DNA encoding an a 1B subunit that includes an 
amino acid sequence substantially as set forth in SEQ ID No. 
7 or in SEQ ID No. 8 is also provided. 

DNA encoding a 1A subunits is also provided. Such DNA 
includes DNA encoding an a 1A subunit that has substantially the 
same sequence of amino acids as encoded by the DNA set forth 
in SEQ ID No. 22 or No. 23 or other splice variants of a lh that 
include all or part of the sequence set forth in SEQ ID No. 22 
or 23. The sequence set forth in SEQ ID NO. 22 is a splice 
variant designated a 1A _ i; and the sequence set forth in SEQ ID 
NO. 23 is a splice variant designated a 1A _ 2 . DNA encoding <* 1A 
subunits also include DNA encoding subunits that can be 
isolated using all or a portion of the DNA having SEQ ID NO. 
21, 22 or 23 or DNA obtained from the phage lysate of an E. 
coli host containing DNA encoding an a 1A subunit that has been 
deposited in the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 U.S.A. under 
Accession No. 75293 in accord with the Budapest Treaty. The 
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DNA in such phage includes a DNA fragment having the sequence 
set forth in SEQ ID No. 21. This fragment selectively 
hybridizes under conditions of high stringency to DNA encoding 
<y 1A but not to DNA encoding a XB and, thus, can be used to 
isolate DNA that encodes a 1A subunits . 

DNA encoding a 1E subunits of a human calcium channel is 
also provided. This DNA includes DNA that encodes - an or 1E 
splice variant designated a 1E ^ encoded by the DNA set forth in 
SEQ ID No. 24, and a variant designated o? 1E _ 3 encoded by SEQ ID 
No. 25. This DNA also includes other splice variants thereof 
that encodes sequences of amino acids encoded by all or a 
portion of the sequences of nucleotides set forth in SEQ ID 
Nos. 24 and 25 and DNA that hybridizes under conditions of 
high stringency to the DNA of SEQ ID. No. 24 or 25 and that 
encodes an a 1E splice variant. 

DNA encoding a 2 subunits of a human calcium channel, and 
RNA encoding such subunits, made upon transcription of such a 
DNA are provided. DNA encoding splice variants of the a 2 
subunit, including tissue-specific splice variants, are also 
provided. In particular, DNA encoding the a 2 ^a 2e subunit 
subtypes is provided. In particularly preferred embodiments, 
the DNA encoding the a 2 subunit that is produced by 
alternative processing of a primary transcript that includes 
DNA encoding the amino acids set forth in SEQ ID 11 and the 
DNA of SEQ ID No. 13 inserted between nucleotides 1624 and 
1625 of SEQ ID No. 11 is provided. The DNA and amino acid 
sequences of o? 2a - a 2e are set forth in SEQ. ID Nos. 11 and 
29-32 , respectively. 

Isolated and purified DNA fragments encoding human 
calcium channel /3 subunits, including DNA encoding & lt p 2 , 0 3 
and 0 4 subunits, and splice variants of the /5 subunits are 
provided. RNA encoding /3 subunits, made upon transcription .of 
the DNA is also provided. 

DNA encoding a p x subunit that is produced by alternative 
processing of a primary transcript that includes DNA encoding 
the amino acids set forth in SEQ ID No. 9, but including the 
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DNA set forth in SEQ ID No. 12 inserted in place of 
nucleotides 615-781 of SEQ ID No. 9 is also provided. DNA 
encoding fi x subunits that are encoded by transcripts that have 
the sequence set forth in SEQ ID No. 9 including the DNA set 
forth in SEQ ID No. 12 inserted in place of nucleotides 615- 
781 of SEQ ID No. 9, but that lack one or more of the 
following sequences of nucleotides: nucleotides 14-34 of SEQ 
ID No. 12, nucleotides 13-34 of SEQ ID No. 12, nucleotides 35- 
55 of SEQ ID No 12, nucleotides 56-190 of SEQ ID No. 12 and 
nucleotides 191-271 of SEQ ID No. 12 are also provided. in 
particular, /S, subunit splice variants fi lml -fi lm9 (see, SEQ ID 
Nos. 9, io and 33-35) described below, are provided. 

B 2 subunit splice variants /B 2c -0 2e , that include all or a 
portion of SEQ ID Nos. 26, 29 and 3 0 are provided; /S 3 subunit 
splice variants, including 0 3 subunit splice variants that 
have the sequences set forth in SEQ id Nos 19 and 20, and DNA 
encoding the fi 4 subunit that includes DNA having the sequence 
set forth in SEQ ID No. 27 and the amino acid sequence set 
forth in SEQ ID No. 28 are provided. 

Also Escherichia coli (E. coli) host cells harboring 
plasmids containing DNA encoding 0 3 have been deposited in 
accord with the Budapest Treaty under Accession No. 69048 at 
the American Type Culture Collection. The deposited clone 
encompasses nucleotides 122-457 in SEQ ID No. 19 and 107-443 
in SEQ ID No. 20. 

DNA encoding (3 subunits that are produced by alternative 
processing of a primary transcript encoding a 0 subunit, 
including a transcript that includes DNA encoding the amino 
acids set forth in SEQ ID No. 9 or including a primary 
transcript that encodes /? 3 as deposited under ATCC Accession 
No. 6904 8, but lacking and including alternative exons are 
provided or may be constructed from the DNA provided herein. 

DNA encoding y subunits of human calcium channels is also 
provided. RNA, encoding y subunits, made upon transcription 
of the DNA are also provided. In particular, DNA containing 
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the sequence of nucleotides set forth in SEQ ID No. 14 is 
provided . 

Full-length DNA clones and corresponding RNA transcripts, 
encoding a lt including splice variants of a 1A , or 1D , a 1B , a ic , and 
or 1E , a 2 and 0 subunits, including P^-Py.*. P20 #20' @2e> ^3-1 and 
yS 4 of human calcium channels are provided. Also provided are 
DNA clones encoding a substantial portions of the certain ar ic 
subtype subunits and y subunits of volt age -dependent human 
calcium channels for the preparation of full-length DNA clones 
encoding the corresponding full-length subunits. Full-length 
clones may be readily obtained using the disclosed DNA as a 
probe as described herein. 

The the a 1A subunit, o lc subunit, a 1E subunit and splice 
variants thereof, the /3„, /3 2C and /3 SE subunits and /J 4 subunits 
and nucleic acids encoding these subunits are of particular 

interest herein. 

Eukaryotic cells containing heterologous DNA encoding one 
or more calcium channel subunits, particularly human calcium 
channel subunits, or containing RNA transcripts of DNA clones 
encoding one or more of the subunits are provided. A single 
ttl subunit can form a channel. The requisite combination of 
subunits for formation of active channels in selected cells, 
however, can be determined empirically using the methods 
herein. For example, if a selected a, subtype or variant does 
not form an active channel in a selected cell line, an 
additional subunit or subunits can be added until an active 

channel is formed. 

In preferred embodiments, the cells contain DNA or RNA 
encoding a human a x subunit, preferably at least an a XD , or„, a» 
or a 1E subunit. In more preferred embodiments, the cells 
contain DNA or RNA encoding additional heterologous subunits, 
including at least one 0, a 2 or y subunit. In such 
embodiments, eukaryotic cells stably or transiently 
transfected with any combination of one, two, three or four of 
the subunit -encoding DNA clones, such as DNA encoding any of 
a,, a a + 0, «i + P + ct 3 , are provided. 
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The eukaryotic cells provided herein contain heterologous 
DNA that encodes an 0l subunit or heterologous DNA that 
encodes an subunit and heterologous DNA that encodes a B 
subunit. At least one subunit is selected of 1JW , 0lA _ 2 , a le2t 
a iE-i* QfiE-3» 02c> B 2o> B 2P , a B 3ml , B 3 . 2 subunit or a /? 4 subunit. ii 
preferred embodiments, the cells express such heterologous 
calcium channel subunits and include one or more of the 
subunits in membrane -spanning heterologous calcium channels. 
In more preferred embodiments, the eukaryotic cells express 
functional, heterologous calcium channels that are capable of 
gating the passage of calcium channel -selective ions and/or 
binding compounds that, at physiological concentrations, 
modulate the activity of the heterologous calcium channel . in 
certain embodiments, the heterologous calcium channels include 
at least one heterologous calcium channel subunit. in most 
preferred embodiments, the calcium channels that are expressed 
on the surface of the eukaryotic cells are composed 
substantially or entirely of subunits encoded by the 
heterologous DNA or RNA. In preferred embodiments, the 
heterologous calcium channels of such cells are 
distinguishable from any endogenous calcium channels of the 
host cell. Such cells provide a means to obtain homogeneous 
populations of calcium channels. Typically, the cells contain 
the selected calcium channel as the only heterologous ion 
channel expressed. by the cell. 

In certain embodiments the recombinant eukaryotic cells 
that contain the heterologous DNA encoding the calcium channel 
subunits are produced by transf ection with DNA encoding one or 
more of the subunits or are injected with RNA transcripts of 
DNA encoding one or more of the calcium channel subunits . The 
DNA may be introduced as a linear DNA fragment or may be 
included in an expression vector for stable or transient 
expression of the subunit -encoding DNA. Vectors containing 
DNA encoding human calcium channel subunits are also provided. 
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The eukaryotic cells that express heterologous calcium 
channels may be used in assays for calcium channel function 
or, in the case of cells transformed with fewer subunit- 
encoding nucleic acids than necessary to constitute a 
functional recombinant human calcium channel, such cells may 
be used to assess the effects of additional subunits on 
calcium channel activity. The additional subunits can be 
provided by subsequently transfecting such a cell with one or 
more DNA clones or RNA transcripts encoding human calcium 
channel subunits . 

The recombinant eukaryotic cells that express membrane 
spanning heterologous calcium channels may be used in methods 
for identifying compounds that modulate calcium channel 
activity. In particular, the cells are used in assays that 
identify agonists and antagonists of calcium channel activity 
in humans and/or assessing the contribution of the various 
calcium channel subunits to the transport and regulation of 
transport of calcium ions. Because the cells constitute 
homogeneous populations of calcium channels, they provide a 
means to identify agonists or antagonists of calcium channel 
activity that are specific for each such population. 

The assays that use the eukaryotic cells for identifying 
compounds that modulate calcium channel activity are also 
provided. In practicing these assays the eukaryotic cell that 
expresses a heterologous calcium channel, containing at least 
on subunit encoded by the DNA provided herein, is in a 
solution containing a test compound and a calcium channel 
selective ion, the cell membrane is depolarized, and current 
flowing into the cell is detected. If the test compound is 
one that modulates calcium channel activity, the current that 
is detected is different from that produced by depolarizing 
the same or a substantially identical cell in the presence of 
the same calcium channel -selective ion but in the absence of 
the compound. In preferred embodiments, prior to the 
depolarization step, the cell is maintained at a holding 
potential which substantially inactivates calcium channels 
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which are endogenous to the cell. Also in preferred 

embodiments, the cells are mammalian cells, most preferably 
HEK cells, or amphibian oocytes. 

Nucleic acid probes, typically labeled for detection, 
containing at least about 14, preferably 16, or, if desired, 
20 or 30 or more, contiguous nucleotides of a 1D , o lc , a 1B , a 1A 
and a 1E , or 2/ 0, including fi„ p 2 ,/3 3 and (3 4 splice variants and 
T subunit- encoding DNA are provided. Methods using the probes 
for the isolation and cloning of calcium channel subunit - 
encoding DNA, including splice variants within tissues and 
inter-tissue variants are also provided. 

Purified human calcium channel subunits and purified 
human calcium channels are provided. The subunits and 
channels can be isolated from a eukaryotic cell transfected 
with DNA that encodes the subunit. 

In another embodiment, immunoglobulins or antibodies 
obtained from the serum of an animal immunized with a 
substantially pure preparation of a human calcium channel, 
human calcium channel subunit or epitope -containing fragment 
of a human calcium subunit are provided. Monoclonal 
antibodies produced using a human calcium channel, human 
calcium channel subunit or epitope -containing fragment thereof 
as an immunogen are also provided. E. coli fusion proteins 
including a fragment of a human calcium channel subunit may 
also be used as immunogen. Such fusion proteins may contain 
a bacterial protein or portion thereof, such as the E. coli 
TrpE protein, fused to a calcium channel subunit peptide. The 
immunoglobulins that are produced using the calcium channel 
subunits or purified calcium channels as immunogens have, 
among other properties, the ability to specifically and 
preferentially bind to and/or cause the immunoprecipitation of 
a human calcium channel or a subunit thereof which may be 
present in a biological sample or a solution derived from such 
a biological sample. Such antibodies may also be used to 
selectively isolate cells that express calcium channels that 
contain the subunit for which the antibodies are specific. 
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Methods for modulating the activity of ion channels by 
contacting the calcium channels with an effective amount of 
the above-described antibodies are also provided. 

A diagnostic method for determining th<e presence of 
Lambert Eaton Syndrome (LES) in a human based on immunological 
reactivity of LES immunoglobulin G (IgG) with a human calcium 
channel subunit or a eukaryotic cell which expresses a 
recombinant human calcium channel or a subunit thereof is also 
provided. In particular, an immunoassay method for diagnosing 
Lambert -Eaton Syndrome in a -person by combining serum or an 
IgG fraction from the person (test serum) with calcium channel 
proteins, including the a and 0 subunits, and ascertaining 
whether antibodies in the test serum react with one or more of 
the subunits, or a recombinant cell which expresses one or 
more of the subunits to a greater extent than antibodies in 
control serum, obtained from a person or group of persons 
known to be free of the Syndrome, is provided. Any 
immunoassay procedure known in the art for detecting 
antibodies against a given antigen in serum can be employed in 
the method. 

DETAILED DESCRIPTION OP THE INVENTION 
Definitions : 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as is commonly 
understood by one of skill in the art to which this invention 
belongs. All patents and publications referred to herein are 
incorporated by reference herein. 

Reference to each of the calcium channel subunits 
includes the subunits that are specifically disclosed herein 
and human calcium channel subunits encoded by DNA that can be 
isolated by using the DNA disclosed as probes and screening an 
appropriate human cDNA or genomic library under at least low 
stringency. Such DNA also includes DNA that encodes proteins 
that have about 4 0% homology to any of the subunits proteins 
described herein or DNA that hybridizes under conditions of at 
least low stringency to the DNA provided herein and the 
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protein encoded by such DNA exhibits additional identifying 
characteristics, such as function or molecular weight. 

It is understood that subunits that are encoded by 
transcripts that represent splice variants of the disclosed 
subunits or other such subunits may exhibit less than 40% 
overall homology to any single subunit, but will include 
regions of such homology to one or more such subunits. It is 
also understood that 4 0% homology refers to proteins that 
share approximately 4 0% of their amino acids in common or that 
share somewhat less, but include conservative amino acid 
substitutions, whereby the activity of the protein is not 
substantially altered. 

As used herein, the a x subunits types, encoded by 
different genes, are designated as type c^, a 1Bt a lc , a 1D and 
<* 1E . These types have also been referred to as VDCC IV for a 1B , 
VDCC II for a lc and VDCC III for c* 1D . Subunit subtypes, which 
are splice variants, are referred to, for example as c^, a 1B . 
2/ Cic-i etc. 

Thus, as used herein, DNA encoding the a x subunit refers 
to DNA that hybridizes to the DNA provided herein under 
conditions of at least low stringency or encodes a subunit 
that has at least about 4 0% homology to protein encoded by DNA 
disclosed herein that encodes an a, subunit of a human 
calcium. An a z subunit may be identified by its ability to 
form a calcium channel. Typically, a x subunits have 

molecular masses greater than at least about 120 kD. Also, 
hydropathy plots of deduced or 1 subunit amino acid sequences 
indicate that the a 2 subunits contain four internal repeats, 
each containing six putative transmembrane domains. 

The activity of a calcium channel may be assessed in 
vitro by methods known to those of skill in the art, including 
the electrophysiological and other methods described herein. 
Typically, a 2 subunits include regions to which one or more 
modulators of calcium channel activity, such as a 1,4-DHP or 
a>-CgTx, interact directly or indirectly. Types of a 2 subunits 
may be distinguished by any method known to those of skill in 
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the art, including on the basis of binding specificity. For 
example, it has been found herein that a 1B subunits participate 
in the formation channels that have previously been referred 
to as N-type channels, a 1D subunits participate in the 
formation of channels that had previously been referred to as 
L-type channels, and a 1A subunits appear to participate in the 
formation of channels that exhibit characteristics typical of 
channels that had previously been designated P-type channels. 
Thus, for example, the activity of channels that contain the 
a 1B subunit are insensitive to 1,4-DHPs; whereas the activity 
of channels that contain the a 1D subunit are modulated or 
altered by a 1,4-DHP. It is presently preferable to refer to 
calcium channels based on pharmacological characteristics and 
current kinetics and to avoid historical designations. Types 
and subtypes of a x subunits may be characterized on the basis 
of the effects of such modulators on the subunit or a channel 
containing the subunit as well as differences in currents and 
current kinetics produced by calcium channels containing the 
subunit . 

As used herein, an a 2 subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 4 0% 
homology with that disclosed herein. Such DNA encodes a 
protein that typically has a molecular mass greater than about 
12 0 kD, but does not form a calcium channel in the absence of 
an a x subunit, and may alter the activity of a calcium channel 
that contains an subunit. Subtypes of the a 2 subunit that 
arise as splice variants are designated by lower case letter, 
such as a 2a , . . . a 2e . In addition, the a 2 subunit and the 
large fragment produced when the protein is subjected to 
reducing conditions appear to be glycosylated with at least 
N-linked sugars and do not specifically bind to the 1,4-DHPs 
and phenylalkylamines that specifically bind to the a 1 
subunit. The smaller fragment, the C-terminal fragment, is 
referred to as the 6 subunit and includes amino acids from 
about 946 (SEQ ID No. 11) through about the O terminus . This 
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fragment may dissociate from the remaining portion of a 2 when 
the a 2 subunit is exposed to reducing conditions. 

As used herein, a /3 subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 4 0% 
homology with that disclosed herein and is a protein that 
typically has a molecular mass lower than the a subunits and 
on the order of about 50-80 kD, does not form a detectable 
calcium channel in the absence of an <* a subunit, but may alter 
the activity of a calcium channel that contains an a x subunit 
or that contains an and ot 2 subunit. 

Types of the /? subunit that are encoded by different 
genes are designated with subscripts, such as fi x , /3 2 , /? 3 and /S 4 . 
Subtypes of 0 subunits that arise as splice variants of a 
particular type are designated with a numerical subscript 
referring to the type and to the variant. Such subtypes 
include, but are not limited to the 0 X splice variants, 
including P^-p^ and (3 2 variants, including 0 2C -/? 2E . 

As used herein, a y subunit is a subunit encoded by DNA 
disclosed herein as encoding the y subunit and may be isolated 
and identified using the DNA disclosed herein as a probe by 
hybridization or other such method known to those of skill in 
the art, whereby full-length clones encoding a y subunit may 
be isolated or constructed. A y subunit will be encoded by 
DNA that hybridizes to the DNA provided herein under 
conditions of low stringency or exhibits sufficient sequence 
homology to encode a protein that has at least about 4 0% 
homology with the y subunit described herein. 

Thus, one of skill in the art, in light of the disclosure 
herein, can identify DNA encoding cr a# a 2f 0, 6 and y calcium 
channel subunits, including types encoded by different genes 
and subtypes that represent splice variants. For example, 
DNA probes based on the DNA disclosed herein may be used to 
screen an appropriate library, including a genomic or cDNA 
library, for hybridization to the probe and obtain DNA in one 
or more clones that includes an open reading fragment that 
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encodes an entire protein. Subsequent to screening an 
appropriate library with the DNA disclosed herein, the 
isolated DNA can be examined for the presence of an open 
reading frame from which the sequence of the encoded protein 
may be deduced. Determination of the molecular weight and 
comparison with the sequences herein should reveal the 
identity of the subunit as an ct lt ot 2 etc. subunit . Functional 
assays may, if necessary, be used to determine whether the 
subunit is an a x , a 2 subunit or /3 subunit. 

For example, DNA encoding an or 1A subunit may be isolated 
by screening an appropriate library with DNA, encoding all or 
a portion of the human a 1A subunit . Such DNA includes the DNA 
in the phage deposited under ATCC Accession No. 752 93 that 
encodes a portion of an a a subunit. DNA encoding an o? 1A 
subunit may obtained from an appropriate library by screening 
with an oligonucleotide having all or a portion of the 
sequence set forth in SEQ ID No. 21, 22 and/or 23 or with the 
DNA in the deposited phage. Alternatively, such DNA may have 
a sequence that encodes an a 1A subunit that is encoded by SEQ 
ID NO. 22 or 23. 

Similarly, DNA encoding 0 3 may be isolated by screening 
a human cDNA library with DNA probes prepared from the plasmid 
01.42 deposited under ATCC Accession No. 69048 or obtained 
from an appropriate library using probes having sequences 
prepared according to the sequences set forth in SEQ ID Nos . 
19 and/or 20. Also, DNA encoding /S 4 may be isolated by 
screening a human cDNA library with DNA # probes prepared 
according to DNA set forth in SEQ ID No. 27, which sets forth 
the DNA sequence of a clone encoding a /3 4 subunit . The amino 
acid sequence is set forth in SEQ ID No. 28. Any method known 
to those of skill in the art for isolation and identification 
of DNA and preparation of full-length genomic or cDNA clones, 
including methods exemplified herein, may be used. DNA 
encoding 

The subunit encoded by isolated DNA may be identified by 
comparison with the DNA and amino acid sequences of the 
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subunits provided herein. Splice variants share extensive 
regions of homology, but include non-homologous regions, 
subunits encoded by different genes share a uniform 
distribution of non- homologous sequences. 

As used herein, a splice variant refers to a variant 
produced by differential processing of a primary transcript of 
genomic DNA that results in more than one type of mRNA. 
Splice variants may occur within a single tissue type or among 
tissues (tissue-specific variants) . Thus, cDNA clones that 
encode calcium channel subunit subtypes that have regions of 
identical amino acids and regions of different amino acid 
sequences are referred to herein as "splice variants". 

As used herein, a "calcium channel -selective ion" is an 
ion that is capable of flowing through, or being blocked from 
flowing through, a calcium channel which spans a cellular 
membrane under conditions which would substantially similarly 
permit or block the flow of Ca 2+ . Ba 2+ is an example of an ion 
which is a calcium channel -selective ion. 

As used herein, a compound that modulates calcium channel 
activity is one that affects the ability of the calcium 
channel to pass calcium channel -selective ions or affects 
other detectable calcium channel features, such as current 
kinetics. Such compounds include calcium channel antagonists 
and agonists and compounds that exert their effect on the 
activity of the calcium channel directly or indirectly. 

As used herein, a "substantially pure" subunit or protein 
is a subunit or protein that is sufficiently free of other 
polypeptide contaminants to appear homogeneous by SDS-PAGE or 
to be unambiguously sequenced. 

As used herein, selectively hybridize means that a DNA 
fragment hybridizes to a second fragment with sufficient 
specificity to permit the second fragment to be identified or 
isolated from among a plurality of fragments. In general, 
selective hybridization occurs at conditions of high 
stringency. 
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As used herein, heterologous or foreign DNA and RNA are 
used interchangeably and refer to DNA or RNA that does not 
occur naturally as part of the genome in which it is present 
or which is found in a location or locations in the genome 
that differ from that in which it occurs in nature. It is DNA 
or RNA that is not endogenous to the cell and has been 
artificially introduced into the cell. Examples of 

heterologous DNA include, but are not limited to, DNA that 
encodes a calcium channel subunit and DNA that encodes RNA or 
proteins that mediate or alter expression of endogenous DNA by 
affecting transcription, translation, or other regulatable 
biochemical processes. The cell that expresses the 

heterologous DNA, such as DNA encoding a calcium channel 
subunit, may contain DNA encoding the same or different 
calcium channel subunits . The heterologous DNA need not be 
expressed and may be introduced in a manner such that it is 
integrated into the host cell genome or is maintained 
episomally . 

As used herein, operative linkage of heterologous DNA to 
regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational stop 
sites, and other signal sequences, refers to the functional 
relationship between such DNA and such sequences of 
nucleotides. For example, operative linkage of heterologous 
DNA to a promoter refers to the physical and functional 
relationship between the DNA and the promoter such that the 
transcription of such DNA is initiated from the promoter by an 
RNA polymerase that specifically recognizes, binds to and 
transcribes the DNA in reading frame. 

As used herein, isolated, substantially pure DNA refers 
to DNA fragments purified according to standard techniques 
employed by those skilled in the art [see, e.g., Maniatis et 
al. (1982) Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY] . 

As used herein, expression refers to the process by which 
nucleic acid is transcribed into mRNA and translated into 



BNSDOCID: <WO 9504822A1 J_> 



WO 95/04822 



PCT/US94/09230 



-20- 

peptides, polypeptides, or proteins. If the nucleic acid is 
derived from genomic DNA, expression may, if an appropriate 
eukaryotic host cell or organism is selected, include splicing 
of the mRNA. 

As used herein, vector or plasmid refers to discrete 
elements that are used to introduce heterologous DNA into 
cells for either expression of the heterologous DNA or for 
replication of the cloned heterologous DNA. Selection and use 
of such vectors and plasmids are well within the level of 
skill of the art. 

As used herein, expression vector includes vectors 
capable of expressing DNA fragments that are in operative 
linkage with regulatory sequences, such as promoter regions, 
that are capable of effecting expression of such DNA 
fragments. Thus, an expression vector refers to a recombinant 
DNA or RNA construct, such as a plasmid, a phage, recombinant 
virus or other vector that, upon introduction into an 
appropriate host cell, results in expression of the cloned 
DNA. Appropriate expression vectors are well known to those 
of skill in the art and include those that are replicable in 
eukaryotic cells and/or prokaryotic cells and those that 
remain episomal or may integrate into the host cell genome. 

As used herein, a promoter region refers to the portion 
of DNA of a gene that controls transcription of DNA to which 
it is operatively linked. The promoter region includes 

specific sequences of DNA that are sufficient for rna * 
polymerase recognition, binding and transcription initiation. 
This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences 
that modulate this recognition, binding and transcription 
initiation activity of the RNA polymerase. These sequences 
may be cis acting or may be responsive to trans acting 
factors. Promoters, depending upon the nature of the 
regulation, may be constitutive or regulated. 

As used herein, a recombinant eukaryotic cell is a 
eukaryotic cell that contains heterologous DNA or RNA. 
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As used herein, a recombinant or heterologous calcium 
channel refers to a calcium channel that contains one or more 
subunits that are encoded by heterologous DNA that has been 
introduced into and expressed in a eukaryotic cells that 
expresses the recombinant calcium channel. A recombinant 
calcium channel may also include subunits that are produced by 
DNA endogenous to the cell. In certain embodiments, the 
recombinant or heterologous calcium channel may contain only 
subunits that are encoded by heterologous DNA. 

As used herein, "functional" with respect to a 
recombinant or heterologous calcium channel means that the 
channel is able to provide for and regulate entry of calcium 
channel -selective ions, including, but not limited to, Ca 2+ or 
Ba 2 *, in response to a stimulus and/or bind ligands with 
affinity for the channel. Preferably such calcium channel 
activity is distinguishable, such as electrophysiological, 
pharmacological and other means known to those of skill in the 
art, from any endogenous calcium channel activity that in the 
host cell. 

As used herein, a peptide having an amino acid sequence 
substantially as set forth in a particular SEQ ID No. 
includes peptides that have the same function but may include 
minor variations in sequence, such as conservative amino acid 
changes or minor deletions or insertions that do not alter the 
activity of the peptide. The activity of a calcium channel 
receptor subunit peptide refers to its ability to form 
functional calcium channels with other such subunits. 

As used herein, a physiological concentration of a 
compound is that which is necessary and sufficient for a 
biological process to occur. For example, a physiological 
concentration of a calcium channel -selective ion is a 
concentration of the calcium channel -selective ion necessary 
and sufficient to provide an inward current when the channels 
open. 

As used herein, activity of a calcium channel refers to 
the movement of a calcium channel -selective ion through a 
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calcium channel . such activity may be measured by any method 
known to those of skill in the art, including, but not limited 
to, measurement of the amount of current which flows through 
the recombinant channel in response to a stimulus. 

As used herein, a "functional assay" refers to an assay 
that identifies functional calcium channels. A functional 
assay, thus, is an assay to assess function. 

As understood by those skilled in the art, assay methods 
for identifying compounds, such as antagonists and agonists, 
that modulate calcium channel activity, generally requires 
comparison to a control. One type of a "control" cell or 
"control" culture is a cell or culture that is treated 
substantially the same as the cell or culture exposed to the 
test compound except that the control culture is not exposed 
to the test compound. Another type of a "control" cell or 
"control" culture may be a cell or a culture of cells which 
are identical to the transfected "cells except the cells 
employed for the control culture do not express functional 
calcium channels. in this situation, the response of test 
cell to the test compound is compared to the response (or lack 
of response) of the calcium channel -negative cell to the test 
compound, when cells or cultures of each type of cell are 
exposed to substantially the same reaction conditions in the 
presence of the compound being assayed. For example, in 
methods that use patch clamp electrophysiological procedures, 
the same cell can be tested in the presence and absence of the 
test compound, by changing the external solution bathing the 
cell as known in the art. 

It is also understood that each of the subunits disclosed 
herein may be modified by making conservative amino acid 
substitutions and the resulting modified subunits are 
contemplated herein. Suitable conservative substitutions of 
amino acids are known to those of skill in this art and may be 
made generally without altering the biological activity of the 
resulting molecule. Those of skill in this art recognize 
that, in general, single amino acid substitutions in non- 



BNSDOCID: <WO 9504822A1_I_> 



WO 95/04822 



PCT/US94/09230 



-23- 

essential regions of a polypeptide do not substantially alter 
biological activity (see, e.g. , Watson et al . Molecular 
Biology of the Gene, 4th Edition, 1987, The Be jacmin/Cummings 
Pub. co., p. 224) . Such substitutions are preferably, although 
not exclusively, made in accordance with those set forth in 



follows : 






TABLE 1 


Original residue 


Conservative substitution 


Ala (A) 


Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1) 


Leu; Val 


Leu (L) 


lie; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; lie 


Phe <F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val (V) 


lie; Leu 



Other substitutions are also permissible and may be 
determined empirically or in accord with known conservative 
substitutions. Any such modification of the polypeptide may 
be effected by any means known to those of skill in this art. 
Mutation may be effected by any method known to those of skill 
in the art, including site-specific or site-directed 
mutagenesis of DNA encoding the protein and the use of DNA 
amplification methods using primers to introduce and amplify 
alterations in the DNA template. 

Identification and isolation of DNA encoding human calcium 
channel subunits 

Methods for identifying and isolating DNA encoding ot lt a 2 , 
13 and y subunits of human calcium channels are provided. 

Identification and isolation of such DNA may be 
accomplished by hybridizing, under appropriate conditions, at 
least low stringency whereby DNA that encodes the desired 
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subunit is isolated, restriction enzyme -digested human DNA 
with a labeled probe having at least 14, preferably 16 or more 
nucleotides and derived from any contiguous portion of DNA 
having a sequence of nucleotides set forth herein by sequence 
identification number. Once a hybridizing fragment is 
identified in the hybridization reaction, it can be cloned 
employing standard cloning techniques known to those of skill 
in the art. Full-length clones may be identified by the 
presence of a complete open reading frame and the identity of 
the encoded protein verified by sequence comparison with the 
subunits provided herein and by functional assays to assess 
calcium channel- forming ability or other function. This 
method can be used to identify genomic DNA encoding the 
subunit or cDNA encoding splice variants of human calcium 
channel subunits generated by alternative splicing of the 
primary transcript of genomic subunit DNA. For instance, DNA, 
cDNA or genomic DNA, encoding a calcium channel subunit may be 
identified by hybridization to a DNA probe and characterized 
by methods known to those of skill in the art, such as 
restriction mapping and DNA sequencing, and compared to the 
DNA provided herein in order to identify heterogeneity or 
divergence in the sequences of the DNA. Such sequence 
differences may indicate that the transcripts from which the 
cDNA was produced result from alternative splicing of a 
primary transcript, if the non- homologous and homologous 
regions are clustered, or from a different gene if the non- 
homologous regions are distributed throughout the cloned DNA. 

Any suitable method for isolating genes using the DNA 
provided herein may be used. For example, oligonucleotides 
corresponding to regions of sequence differences have been 
used to isolate, by hybridization, DNA encoding the full- 
length splice variant and can be used to isolate genomic 
clones. A probe, based on a nucleotide sequence disclosed 
herein, which encodes at least a portion of a subunit of a 
human calcium channel, such as a tissue-specific exon, may be 
used as a probe to clone related DNA, to clone a full-length 



BNSDOCID: <WO 9504822A1 J_> 



WO 95/04822 



PCT/US94/09230 



-25- 

cDNA clone or genomic clone encoding the human calcium channel 
subunit . 

Labeled, including, but not limited to, radioactively or 
enzymatically labeled, RNA or single-stranded DNA of at least 
14 substantially contiguous bases, preferably 16 or more, 
generally at least 3 0 contiguous bases of a nucleic acid which 
encodes at least a portion of a human calcium channel subunit, 
the sequence of which nucleic acid corresponds to a segment of 
a nucleic acid sequence disclosed herein by reference to a SEQ 
ID No. are provided. Such nucleic acid segments may be used 
as probes in the methods provided herein for cloning DNA 
encoding calcium channel subunits. See, generally, Sambrook 
et al. (198 9) Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press. 

In addition, nucleic acid amplification techniques, which 
are well known in the art, can be used to locate splice 
variants of calcium channel subunits by employing 
oligonucleotides based on DNA sequences surrounding the 
divergent sequence primers for amplifying human RNA or genomic 
DNA. Size and sequence determinations of the amplification 
products can reveal splice variants. Furthermore, isolation 
of human genomic DNA sequences by hybridization can yield DNA 
containing multiple exons, separated by introns, that 
correspond to different splice variants of transcripts 
encoding human calcium channel subunits. 

DNA encoding types and subtypes of each of the a lf ot 2t 0 
and y subunit of volt age -dependent human calcium channels has 
been cloned herein by nucleic acid amplication of cDNA from 
selected tissues or by screening human cDNA libraries prepared 
from isolated poly A+ mRNA from cell lines or tissue of human 
origin having such calcium channels. Among the sources of 
such cells or tissue for obtaining mRNA are human brain tissue 
or a human cell line of neural origin, such as a neuroblastoma 
cell line, human skeletal muscle or smooth muscle cells, and 
the like. Methods of preparing cDNA libraries are well known 
in the. art [see generally Ausubel et al . (1987) Current 

) 
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Protocols in Molecular Biology, Wiley- Interscience, New York; 
and Davis et al . (1986) Basic Methods in Molecular Biology, 
Elsevier Science Publishing Co., New York]. 

Preferred regions from which to construct probes include 
5' and/or 3' coding sequences, sequences predicted to encode 
transmembrane domains, sequences predicted to encode 
cytoplasmic loops, signal sequences, ligand- binding sites, and 
other functionally significant sequences (see Table, below) . 
Either the full-length subunit -encoding DNA or fragments 
thereof can be used as probes, preferably labeled with 
suitable label means for ready detection. When fragments are 
used as probes, preferably the DNA sequences will be typically 
from the carboxyl- end -encoding portion of the DNA, and most 
preferably will include predicted transmembrane domain- 
encoding portions based on hydropathy analysis of the deduced 
amino acid sequence [see, e.g. . Kyte and Doolittle [(1982) J". 
Mol. Biol. 167:105]. 



Riboprobes that specific for human calcium channel 
subunit types or subtypes have been prepared. These probes 
are useful for identifying expression of particular subunits 
in selected tissues and cells. The regions from which the 
probes were prepared were identified by comparing the DNA and 
amino acid sequences of all known a or 0 subunit subtypes . 
Regions of least homology, preferably human-derived sequences, 
and generally about 250 to about 600 nucleotides were 
selected. Numerous riboprobes for or and /? subunits have been 
prepared; some of these are listed in the following Table. 
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TABLE 2 
SUMMARY OF RNA PROBES 



SUBUNIT 
SPECIFICITY 


NUCLEOTIDE 
POSITION 


PROBE NAME 


PROBE TYPE 


ORIENTA- 
TION 


alA generic 


3357-3840 


pGEM7ZalA* 


riboprobe 


n/a 


761-790 


SE700 


oligo 


antisense 


3440-3464 


SE718 


oligo 


antisense 






oligo 


sense 


orlB generic 




pi?£iIYl / ^U±i3 e y e 


ri-ooproDe 


n/ a 


^ ^ *a e toco 
O 6 3 5 - O O DO 


pGEM7ZaiB cooh 


riboprobe 


n/a 


alB-1 
specific 


6490-6676 


pCRII 
QflB-1/187 


riboprobe 


n/a 


OflE generic 


3114-3462 ! 


pGEM72alE 


riboprobe 


n/a 


a2b 


1321-1603 


pCRIIc*2b 


riboprobe 


n/a 


(3 generic (?) 


212-236 


SE300 


oligo 


antisense 


01 generic 


1267-1291 


SE301 


oligo 


antisense 


01-2 
specific 


1333-1362 


SE17 


oligo 


antisense 




1682-1706 


SE23 


oligo 


sense 


2742-2766 


SE43 


oligo 


antisense 


27-56 


SE208 


oligo 


antisense 


340-364 


SE274 


oligo 


antisense 


340-364 


SE275 


oligo 


sense 


03 specific 


1309-1509 




riboprobe 


n/a 


04 specific 


1228-1560 




riboprobe 


n/a 



* The pGEM series are available from Promega, Madison WI ; see also, 
U.S. Patent No. 4,766,072. 



The above -noted nucleotide regions are also useful in 
selecting regions of the protein for preparation of subunit- 
specific antibodies, discussed below. 

The DNA clones and fragments thereof provided herein thus 
can be used to isolate genomic clones encoding each subunit 
and to isolate any splice variants by hybridization screening 
of libraries prepared from different human tissues. Nucleic 
acid amplification techniques, which are well known in the 
art, can also be used to locate DNA encoding splice variants 
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of human calcium channel subunits. This is accomplished by 
employing oligonucleotides based on DNA sequences surrounding 
divergent sequence (s) as primers for amplifying human RNA or 
genomic DNA. Size and sequence determinations of the 
amplification products can reveal the existence of splice 
variants. Furthermore, isolation of human genomic DNA 
sequences by hybridization can yield DNA containing multiple 
exons, separated by introns, that correspond to different 
splice variants of transcripts encoding human calcium channel 
subunits . 

Once DNA encoding a calcium channel subunit is isolated, 
ribonuclease (RNase) protection assays can be employed to 
determine which tissues express mRNA encoding a particular 
calcium channel subunit or variant. These assays provide a 
sensitive means for detecting and quantitating an RNA species 
in a complex mixture of total cellular RNA. The subunit DNA 
is labeled and hybridized with cellular RNA. If complementary 
mRNA is present in the cellular RNA, a DNA -RNA hybrid results. 
The RNA sample is then treated with RNase, which degrades 
single -stranded RNA. Any RNA -DNA hybrids are protected from 
RNase degradation and can be visualized by gel electrophoresis 
and autoradiography. In situ hybridization techniques can 
also be used to determine which tissues express mRNA encoding 
a particular calcium channel subunit. The labeled subunit 
DNAs are hybridized to different tissue slices to visualize 
subunit mRNA expression. 

With respect to each of the respective subunits (a 1# a 2 , 
(3 or 7) of human calcium channels, once the DNA encoding the 
channel subunit was identified by a nucleic acid screening 
method, the isolated clone was used for further screening to 
identify overlapping clones. Some of the cloned DNA fragments 
can and have been subcloned into an appropriate vector such as 
PIBI24/25 (IBI, New Haven, CT) , M13mpl8/19, pGEM4 , pGEM3 , 
pGEM7Z, pSP72 and other such vectors known to those of skill 
in this art, and characterized by DNA sequencing and 
restriction enzyme mapping. A sequential series of 
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overlapping clones may thus be generated for each of the 
subunits until a full-length clone can be prepared by methods, 
known to those of skill in the art, that include 
identification of translation initiation (start) and 
translation termination (stop) codons . For expression of the 
cloned DNA, the 5' noncoding region and other transcriptional 
and translational control regions of such a clone may be 
replaced with an efficient ribosome binding site and other 
regulatory regions as known in the art. Other modifications 
of the 5' end, known to those of skill in the art, that may be 
required to optimize translation and/or transcription 
efficiency may also be effected, if deemed necessary. 

Examples II-VIIII, below, describe in detail the cloning 
of each of the various subunits of a human calcium channel as 
well as subtypes and splice variants, including tissue- 
specific variants thereof. In the few instances in which 
partial sequences of a subunit are disclosed, it is well 
within the skill of the art, in view of the teaching herein, 
to obtain the corresponding full-length clones and sequence 
thereof encoding the subunit, subtype or splice variant 
thereof using the methods described above and exemplified 
below. 

Identification and isolation of DNA encoding 
subunits 

A number of voltage-dependent calcium channel subunit 
genes, which are expressed in the human CNS and in other 
tissues, have been identified and have been designated as a 1A , 
a 1B (or VDCC IV), a lc (or VDCC II), a 1D (or VDCC III) and a 1E . 
DNA, isolated from a human neural cDNA library, that encodes 
each of the subunit types has been isolated. DNA encoding 
subtypes of each of the types, which arise as splice variants 
are also provided. Subtypes are herein designated, for 
example, as a 1B . lf <*ib-2- 

The a x subunits types A B, C, D and E of voltage - 
dependent calcium channels, and subtypes thereof, differ with 
respect to sensitivity to known classes of calcium channel 
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agonists and antagonists, such as DHPs, phenylalkylamines, 
omega conotoxin (w-CgTx) , the funnel web spider toxin u-Aga- 
IV, and pyrazonoylguanidines . These subunit types also appear 
to differ in the holding potential and in the kinetics of 
currents produced upon depolarization of cell membranes 
containing calcium channels that include different types of a 1 
subunit s . 

DNA that encodes an a a subunit that binds to at least one 

compound selected from among dihydropyridines , 

phenylalkylamines, u-CgTx, components of funnel web spider 

toxin, and pyrazonoylguanidines is provided. For example, the 

a 1B subunit provided herein appears to specifically interact 

with w-CgTx in N-type channels, and the a 1D subunit provided 

herein specifically interacts with DHPs in L-type channels. 

Identification and isolation of DNA 
encoding the human calcium channel 

subunit 

The or 1D subunit cDNA has been isolated using fragments, of 
the rabbit skeletal muscle calcium channel a 1 subunit cDNA as 
a probe to screen a cDNA library of a human neuroblastoma cell 
line, IMR32, to obtain clone al.36. This clone was used as a 
probe to screen additional IMR32 cell cDNA libraries to obtain 
overlapping clones, which were then employed for screening 
until a sufficient series of clones to span the length of the 
nucleotide sequence encoding the human a 1D subunit was 
obtained. Full-length clones encoding ot 1D were constructed by 
ligating portions of partial or 1D clones as described in Example 
II. SEQ ID No. 1 shows the 7,635 nucleotide sequence of the 
cDNA encoding the a 1D subunit. There is a 6,483 nucleotide 
sequence reading frame which encodes a sequence of 2,161 amino 
acids (as set. forth in SEQ ID No. 1) . 

SEQ ID No. 2 provides the sequence of an alternative exon 
encoding the IS6 transmembrane domain [see Tanabe, T. , et al . 
(1987) Nature 325:313-318 for a description of transmembrane 
domain terminology] of the a 1D subunit. 

SEQ ID No. 1 also shows the 2,161 amino acid sequence 
deduced from the human neuronal calcium channel a 1D subunit 
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DNA. Based on the amino acid sequence, the a 1D protein has a 

calculated Mr of 245,163. The a 1D subunit of the calcium 

channel contains four putative internal repeated sequence 

regions. Four internally repeated regions represent 24 

putative transmembrane segments, and the amino- and 

carboxyl- termini extend intracellularly . 

The a 1D subunit has been shown to mediate DHP- sensitive , 

high-voltage-activated, long-lasting calcium channel activity. 

This calcium channel activity was detected when oocytes were 

co- injected with RNA transcripts encoding an a 1D and fl lm2 or a 1T>t 

ot 2h and /S^ subunits. This activity was distinguished from Ba 2+ 

currents detected when oocytes were injected with RNA 

transcripts encoding the (S^ 2 ± a 2h subunits. These currents 

pharmacologically and biophysically resembled Ca 2+ currents 

reported for uninjected oocytes. 

Identification and isolation of DNA 
encoding the a u human calcium channel 
subunit 

Biological material containing DNA encoding a portion of 
the a 1A subunit had been deposited in the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, Maryland 20852 
U.S.A. under the terms of the Budapest Treaty on the 
International Recognition of Deposits of Microorganisms for 
Purposes of Patent Procedure and the Regulations promulgated 
under this Treaty. Samples of the deposited material are and 
will be available to industrial property offices and other 
persons legally entitled to receive them under the terms of 
the Treaty and Regulations and otherwise in compliance with 
the patent laws and regulations of the United States of 
America and all other nations or international organizations 
in which this application, or an application claiming priority 
of this application, is filed or in which any patent granted 
on any such application is granted. 

A portion of an a 1A subunit is encoded by an approximately 
3 kb insert in XgtlO phage designated al.254 in E. coli host 
strain NM514 . A phage lysate of this material has been 
deposited as at the American Type Culture Collection under 
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ATCC Accession No. 75293, as described above. DNA encoding a 1A 
may also be identified by screening with a probe prepared from 
DNA that has SEQ ID No. 21: 

5 ' CTCAGTACCATCTCTGATACCAGCCCCA 3 ' . 

or 1A splice variants have been obtained. The sequences of 
two a 1A splice variants, a 1A ^ and a la . 2 are set forth in SEQ. ID 
Nos. 22 and 23. Other splice variants may be obtained by 
screening a human library as described above or using all or 
a portion of the sequences set forth in SEQ ID Nos. 22 and 23. 

Identification and isolation of DNA 
encoding the a 1B human calcium channel 
subunit 

DNA encoding the a 1B subunit was isolated by screening a 
human basal ganglia cDNA library with fragments of the rabbit 
skeletal muscle calcium channel ct x subunit -encoding cDNA. A 
portion of one of the positive clones was used to screen an 
IMR32 cell cDNA library. Clones that hybridized to the basal 
ganglia DNA probe were used to further screen an IMR32 cell 
cDNA library to identify overlapping clones that in turn were 
used to screen a human hippocampus cDNA library. In this way, 
a sufficient series of clones to span nearly the entire length 
of the nucleotide sequence encoding the human or 1B subunit was 
obtained. Nucleic acid amplification of specific regions of 
the IMR32 cell a 1B mRNA yielded additional segments of the or 1B 
coding sequence . 

A full-length a 1B DNA clone was constructed by ligating 
portions of the partial cDNA clones as described in Example 
II. C. SEQ ID Nos. 7 and 8 show the nucleotide sequences of 
DNA clones encoding the a 1B subunit as well as the deduced 
amino acid sequences. The a 1B subunit encoded by SEQ ID No. 
7 is referred to as the a 1B _ x subunit to distinguish it from 
another a 1B subunit, a 1B . 2 / encoded by the nucleotide sequence 
shown as SEQ ID No. 8, which is derived from alternative 
splicing of the a 1B subunit transcript. 

Nucleic acid amplification of IMR32 cell mRNA using 
oligonucleotide primers designed according to nucleotide 
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sequences within the a^^-encoding DNA has identified variants 

of the a 1B transcript that appear to be splice variants because 

they contain divergent coding sequences . 

Identification and isolation of DNA 
encoding the a lc human calcium channel 
subunit 

Numerous a lc -specific DNA clones were isolated. 
Characterization of the sequence revealed the a lc coding 
sequence, the a ic initiation of translation sequence, and an 
alternatively spliced region of o? lc . Alternatively spliced 
variants of the ot lc subunit have been identified. SEQ ID No. 
3 sets forth DNA encoding a substantial protion of an a ic 
subunit. The DNA sequences set forth in SEQ ID No. 4 and No. 
5 encode two possible amino terminal ends of the a lc protein. 
SEQ ID No. 6 encodes an alternative exon for the IV S3 
transmembrane domain. The sequences of substantial 

portions of two a lc splice variants, designated a ic-1 and a ic _ 2 , 
are set forth in SEQ ID NOs . 3 and 36, respectively. 

The isolation and identification of DNA clones encoding 
portions of the a ic subunit is described in detail in Example 
II . 

Identification and isolation of DNA 
encoding the a 1E human calcium channel 
subunit 

DNA encoding a 1E human calcium channel subunits have been 
isolated from an oligo dT-primed human hippocampus library. 
The resulting clones, which are splice variants, were 
designated a 1E ^ and a 1E _ 3 . The subunit designated a 1E .j has the 
amino acid sequence set forth in SEQ ID No. 24, and a subunit 
designated a 1E _ 3 has the amino acid sequence set forth in SEQ 
ID No. 25. These splice variants differ by virtue of a 57 base 
pair insert between nucleotides 2405 and 2406 of SEQ . ID No. 
24 . 

The a 1E subunits provided herein appear to participate in 
the formation of calcium channels that have properties of 
high-voltage activated calcium channels and low-voltage 
activated channels. These channels are rapidly inactivating 
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compared to other high voltage-activated calcium channels. 

In addition these channels exhibit pharmacological profiles 

that are similar to voltage-activated channels, but are also 

sensitive to DHPs and u-Aga-IVA, which block certain high 

voltage activated channels. Additional details regarding the 

electrophysiology and pharmacology of channels containing a 1E 

subunits is provided in Example VII. F. 

Identification and isolation of DNA 
encoding encoding additional a x human 
calcium channel sub unit types and 
subtypes 

DNA encoding additional a x subunits can be isolated and 

identified using the DNA provided herein as described for the 

^ia/ ^iB* a lc , <*id and tfiE subunits or using other methods known 

to those of skill in the art. In particular, the DNA provided 

herein may be used to screen appropriate libraries to isolate 

related DNA. Full-length clones can be constructed using 

methods, such as those described herein, and the resulting 

subunits characterized by comparison of their sequences and 

electrophysiological and pharmacological properties with the 

subunits exemplified herein. 

Identification and isolation of DNA encoding 0 
human calcium channel subunits 

DNA encoding fi 1 

To isolate DNA encoding the (3 X subunit, a human 
hippocampus cDNA library was screened by hybridization to a 
DNA fragment encoding a rabbit skeletal muscle calcium channel 
/? subunit. A hybridizing clone was selected and was in turn 
used to isolate overlapping clones until the overlapping 
clones encompassing DNA encoding the entire the human calcium 
channel p subunit were isolated and sequenced. 

Five alternatively spliced forms of the human calcium 
channel I3 1 subunit have been identified and DNA encoding a 
number of forms have been isolated. These forms are 
designated fi lml§ expressed in skeletal muscle, (3 X . 2 , expressed 
in the CNS, p^, also expressed in the in the CNS, (3^, 
expressed in aorta tissue and HEK 293 cells, and (3 1 _ st 
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expressed in HEK 293 cells. Full-length DNA clones encoding 
the 0 X . 2 and 0 1O subunits have been constructed. The subunits 

0i-2, 0i-< and ^i-s have been identified by nucleic acid 
amplification analysis as alternatively spliced forms of the 
(3 subunit. Sequences of the & splice variants are set forth 
in SEQ ID Nos. 9, 10 and 33-35. 

DNA encoding 0 2 

DNA encoding the 0 2 splice variants has been obtained. 
These splice variants include 0 2C -0 2E . Splice variants 0 ac -0 2E 
include all of sequence set forth in SEQ ID No. 26, except for 
the portion at the 5' end (up to nucleotide 182), which 
differs among splice variants. The sequence set forth in SEQ 
ID No. 26 encodes 0 2D . Additional splice variants may be 
isolated using the methods described herein and 
oligonucleotides including all or portions of the DNA set 
forth in SEQ ID. No. 26 or may be prepared or obtained as 
described in the Examples. The sequences of 0 2 splice 
variants 0 2C and 0 2E are set forth in SEQ ID Nos. 37 and 38, 
respectively . 

DNA encoding 0 3 

DNA encoding the 0 3 subunit and any splice variants 
thereof may be isolated by screening a library, as described 
above for the 0j subunit, using DNA probes prepared according 
to SEQ ID Nos. 19, 20 or using all or a portion of the 
deposited 0 3 clone plasmid 01.42 (ATCC Accession No. 69048). 

The E. coli host containing plasmid 01.42 that includes 
DNA encoding a 0 3 subunit has been deposited as ATCC Accession 
No. 69048 in the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 U.S.A. under the 
terms of the Budapest Treaty on the international Recognition 
of Deposits of Microorganisms for Purposes of Patent Procedure 
and the Regulations promulgated under this Treaty. Samples of 
the deposited material are and will be available to industrial 
property offices and other persons legally entitled to receive 
them under the terms of the Treaty and Regulations and 
otherwise in compliance with the patent laws and regulations 
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of the united States of America and all other nations or 
international organizations in which this application, or an 
application claiming priority of this application, is filed or 
in which any patent granted on any such application is 
granted. 

The ^ encoding plasmid is designated pi.42. The plasmid 
contains a 2.5 kb EcoRI fragment encoding 0 3 inserted into 
vector pGem- 72 F( + ) and has been deposited in *. coli host 
strain DH5*. The sequences of 0 3 splice variants, designated 
03.! and 03. s are set forth in SEQ id Nos. 19 and 20 
respectively. ' 

Identification and isolation of DNA encoding the a2 
human calcium channel eubunit 9 a2 

DNA encoding a human neuronal calcium channel a 2 subunit 
was isolated in a manner substantially similar to that used 
for isolating DNA encoding an «, subunit, except that a human 
genomic DNA library was probed under low and high stringency 
conditions with a fragment of DNA encoding the rabbit skeletal 
muscle calcium channel a 2 subunit. The fragment included 
nucleotides having a sequence corresponding to the nucleotide 
sequence between nucleotides 43 and 272 inclusive of rabbit 
back skeletal muscle calcium channel a 2 subunit cDNA as 
disclosed in PCT International Patent Application Publication 
No. WO 89/09834, which corresponds to U.S. Application Serial 
No. 07/620,520 (now allowed U.S. Application Serial No 
07/914,231), which is a continuation-in-part of United States 
Serial No. 176,899, filed April 4, 1988. 

Example IV describes the isolation of DNA clones encoding 
a 2 subunits of a human calcium channel from a human DNA 
library using genomic DNA and cDNA clones, identified by 
hybridization to the genomic DNA, as probes. 

SEQ ID Nos. 11 and 29-32 show the sequence of DNA 
encoding a 2 subunits. As described in Example V, nucleic acid 
amplification analysis of RNA from human skeletal muscle 
brain tissue and aorta using oligonucleotide primers specific 
for a region of the human neuronal a 2 subunit cDNA that 
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diverges from the rabbit skeletal muscle calcium channel <x 2 

subunit cDNA identified splice variants of the human calcium 

channel ot 2 subunit transcript. 

Identification and isolation of DNA encoding y 
human calcium channel subunit s 

DNA encoding a portion of a human neuronal calcium 
channel y subunit has been isolated as described in detail in 
Example VI. SEQ ID No. 14 shows the nucleotide sequence at 
the 3 '-end of this DNA which includes a reading frame encoding 
a sequence of 43 amino acid residues. Since the portion that 
has been obtained is homologous to the rabbit clone, described 
in allowed co-owned U.S. Application Serial No. 07/482,384, 
the remainder of the clone can be obtained using routine 
methods . 
Antibodies 

Antibodies, monoclonal or polyclonal, specific for 
calcium channel subunit subtypes or for calcium channel types 
can be prepared employing standard techniques, known to those 
of skill in the art, using the subunit proteins or portions 
thereof as antigens. Anti-peptide and anti-fusion protein 
antibodies can be used [see, for example, Bahouth et al . 
(1991) Trends Pharmacol. Sci . 12.: 338-343; Current Protocols in 
Molecular Biology (Ausubel et al . , eds . ) John Wiley and Sons, 
New York (1984)]. Factors to consider in selecting portions 
of the calcium channel subunits for use as immunogens (as 
either a synthetic peptide or a recombinantly produced 
bacterial fusion protein) include antigenicity accessibility 
(i.e., extracellular and cytoplasmic domains), uniqueness to 
the particular subunit, and other factors known to those of 
skill in this art. 

The availability of subunit -specific antibodies makes 
possible the application of the technique of 
immunohistochemistry to monitor the distribution and 
expression density of various subunits (e.g. , in normal vs 
diseased brain tissue) . Such antibodies could also be 
employed in diagnostic, such as LES diagnosis, and therapeutic 
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applications, such as using antibodies that modulate 
activities of calcium channels. 

The antibodies can be administered to a subject employing 
standard methods, such as, for example, by intraperitoneal, 
intramuscular, intravenous, or subcutaneous injection, implant 
or transdermal modes of administration, and the like. One of 
skill in the art can empirically determine dose forms, 
treatment regiments, etc., depending on the mode of 
administration employed. 

Subunit -specif ic monoclonal antibodies and polyclonal 
antisera have been prepared. The regions from which the 
antigens were identified by comparing the DNA and amino acid 
sequences of all known a or 0 subunit subtypes. Regions of 
least homology, preferably human-derived sequences were 
selected. The selected regions or fusion proteins containing 
the selected regions are used as immunogens . Hydrophobicity 
analyses of residues in selected protein regions and fusion 
proteins are also performed; regions of high hydrophobicity 
are avoided. Also, and more importantly, when preparing 
fusion proteins in bacterial hosts, rare codons are avoided. 
In particular, inclusion of 3 or more successive rare codons 
in a selected host is avoided. Numerous antibodies, 
polyclonal and monoclonal, specific for a or P subunits types 
or subtypes have been prepared; some of these are listed in 
the following Table. Exemplary antibodies and peptide antigens 
used to prepare the antibodies are set forth in the following 
Table: 



TABLE 3 



SPECIFICITY 


AMINO ACID 
NUMBER 


ANTIGEN NAME 


ANTIBODY TYPE 


Qfl generic 


112-140 


peptide 1A#1 


polyclonal 


ofl generic 


1420-1447 


peptide 1A#2 


polyclonal 


olA generic 


1048-1208 


alA#2(b)GST fusion* 


polyclonal 
monoclonal 


orlB generic 


983-1106 


alB#2 (b) GST fusion 


polyclonal 
monoclonal 
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orlB-1 


2164-2339 


alB-l#3 GST fusion 


polyclonal 


alB-2 


2164-2237 


alB-2#4 GST fusion 


polyclonal 


alE generic 


985-1004 
<alE-3) 


alE#2 (a) GST fusion 


polyclonal 

coo al cn St«i fh 



* UbT gene sysLcm -lo a voj. ■ 

et al (1988) Gene £7:31. The system provides pGEX plasmids that are 
designed for inducible, high-level expression of genes or gene 
fragments as fusions with Schistosoma japonicum GST. Upon expression 
in a bacterial host, the resulting fusion proteins are purified from 
bacterial lysates by affinity chromatography. 

The GST fusion proteins are each specific for the 
cytoplasmic loop region IIS6-IIS1, which is a region of low 
subtype homology for all subtypes, including ot lc and a lt> , for 
which similar fusions and antisera can be prepared. 
Preparation of recombinant eukaryotic cells containing DNA 
encoding heterologous calcium channel subunits 

DNA encoding one or more of the calcium channel subunits 
or a portion of a calcium channel subunit may be introduced 
into a host cell for expression or replication of the DNA. 
Such DNA may be introduced using methods described in the 
following examples or using other procedures well known to 
those skilled in the art. Incorporation of cloned DNA into a 
suitable expression vector, transfection of eukaryotic cells 
with a plasmid vector or a combination of plasmid vectors, 
each encoding one or more distinct genes or with linear DNA, 
and selection of transfected cells are also well known in the 
art [see, e.g., Sambrook et al . (1989) Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor 
Laboratory Press] . Cloned full-length DNA encoding any of 
the subunits of a human calcium channel may be introduced into 
a plasmid vector for expression in a eukaryotic cell. Such 
DNA may be genomic DNA or cDNA. Host cells may be transfected 
with one or a combination of the plasmids, each of which 
encodes at least one calcium channel subunit. Alternatively, 
host cells may be transfected with linear DNA using methods 
well known to those of skill in the art. 

While the DNA provided herein may be expressed in any 
eukaryotic cell, including yeast cells such as P. pastoris 
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[see, e.g., Cregg et al. (1987) Bio/Technology 5:479], 
mammalian expression systems for expression of the DNA 
encoding the human calcium channel subunits provided herein 
are preferred. 

The heterologous DNA may be introduced by any method 
known to those of skill in the art, such as transfection with 
a vector encoding the heterologous DNA. Particularly preferred 
vectors for transfection of mammalian cells are the pSV2dhfr 
expression vectors, which contain the SV40 early promoter, 
mouse dhfr gene, SV40 polyadenylation and splice sites and 
sequences necessary for maintaining the vector in bacteria, 
cytomegalovirus (CMV) promoter-based vectors such as pCDNAl, 
or pcDNA-amp and MMTV promoter-based vectors. DNA encoding 
the human calcium channel subunits has been inserted in the 
vector pCDNAl at a position immediately following the CMV 
promoter. The vector pCDNAl is presently preferred. 

Stably or transiently transfected mammalian cells may be 
prepared by methods known in the art by transfecting cells 
with an expression vector having a selectable marker gene such 
as the gene for thymidine kinase, dihydrof olate reductase, 
neomycin resistance or the like, and, for transient 
transfection, growing the transfected cells under conditions 
selective for cells expressing the marker gene. Functional 
voltage-dependent calcium channels have been produced in HEK 
2 93 cells transfected with a derivative of the vector pCDNAl 
that contains DNA encoding a human calcium channel subunit. 

The heterologous DNA may be maintained in the cell as an 
episomal element or may be integrated into chromosomal DNA of 
the cell. The resulting recombinant cells may then be 
cultured or subcultured (or passaged, in the case of mammalian 
cells) from such a culture or a subculture thereof. Methods 
for transfection, injection and culturing recombinant cells 
are known to the skilled artisan. Eukaryotic cells in which 

DNA or RNA may be introduced, include any cells that are 
transferable by such DNA or RNA or into which such DNA may be 
injected. Virtually any eukaryotic cell can serve as a 
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vehicle for heterologous DNA. Preferred cells are those that 
can also express the DNA and RNA and most preferred cells are 
those that can form recombinant or heterologous calcium 
channels that include one or more subunits encoded by the 
heterologous DNA. Such cells may be identified empirically or 
selected from among those known to be readily transfected or 
injected. Preferred cells for introducing DNA include those 
that can be transiently or stably transfected and include, but 
are not limited to, cells of mammalian origin, such as COS 
cells, mouse L cells, CHO cells, human embryonic kidney cells, 
African green monkey cells and other such cells known to those 
of skill in the art, amphibian cells, such as Xenopus laevis 
oocytes, or those of yeast such as Saccharomyces cerevisiae or 
Pichia pastoris. Preferred cells for expressing injected RNA 
transcripts or cDNA include Xenopus laevis oocytes. Cells 
that are preferred for transfection of DNA are those that can 
be readily and efficiently transfected. Such cells are known 
to those of skill in the art or may be empirically identified. 
Preferred cells include DG44 cells and HEK 293 cells, 
particularly HEK 293 cells that can be frozen in liquid 
nitrogen and then thawed and regrown. Such HEK 293 cells are 
described, for example in U.S. Patent No. 5,024,939 to Gorman 
[see, also Stillman et al. (1985) Mol . Cell. Biol. 5:2051- 
2060] . 

The cells may be used as vehicles for replicating 
heterologous DNA introduced therein or for expressing the 
heterologous DNA introduced therein. In certain embodiments, 
the cells are used as vehicles for expressing the heterologous 
DNA as a means to produce substantially pure human calcium 
channel subunits or heterologous calcium channels. Host cells 
containing the heterologous DNA may be cultured under 
conditions whereby the calcium channels are expressed. The 
calcium channel subunits may be purified using protein 
purification methods known to those of skill in the art. For 
example, antibodies, such as those provided herein, that 
specifically bind to one or more of the subunits may be used 
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for affinity purification of the subunit or calcium channels 
containing the subunits. 

Substantially pure subunits of a human calcium channel 
a a subunits of a human calcium channel, af 2 subunits of a human 
calcium channel, 0 subunits of a human calcium channel and y 
subunits of a human calcium channel are provided. 
Substantially pure isolated calcium channels that contain at 
least one of the human calcium channel subunits are also 
provided. Substantially pure calcium channels that contain a 
mixture of one or more subunits encoded by the host cell and 
one or more subunits encoded by heterologous DNA or RNA that 
has been introduced into the cell are also provided. 
Substantially pure subtype- or tissue-type specific calcium 
channels are also provided. 

In other embodiments, eukaryotic cells that contain 
heterologous DNA encoding at least one of an a 2 subunit of a 
human calcium channel , an ct 2 subunit of a human calcium 
channel, a 0 subunit of a human calcium channel and a y 
subunit of a human calcium channel are provided. in 
accordance with one preferred embodiment, the heterologous DNA 
is expressed in the eukaryotic cell and preferably encodes a 
human calcium channel subunit. 

Expression of heterologous calcium channels: 
electrophysiology and pharmacology 

Electrophysiological methods for measuring calcium 
channel activity are kwown to those of skill in the art and 
are exemplified herein. Any such methods may be used in order 
to detect the formation of functional calcium channels and to 
characterize the kinetics and other characteristics of the 
resulting currents. Pharmacological studies may be combined 
with the electrophysiological measurements in order to further 
characterize the calcium channels. 

With respect to measurement of the activity of f unctional 
heterologous calcium channels, preferably, endogenous ion 
channel activity and, if desired, heterologous channel 
activity of channels that do not contain the desired subunits, 
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of a host cell can be inhibited to a significant extent by- 
chemical, pharmacological and electrophysiological means, 
including the use of differential holding potential, to 
increase the S/N ratio of the measured heterologous calcium 

channel activity. 

Thus, various combinations of subunits encoded by the DNA 
provided herein are introduced into eukaryotic cells. The 
resulting cells can be examined to ascertain whether 
functional channels are expressed and to determine the 
properties of the channels.. In particularly preferred 
aspects, the eukaryotic cell which contains the heterologous 
DNA expresses it and forms a recombinant functional calcium 
channel activity. In more preferred aspects, the recombinant 
calcium channel activity is readily detectable because it is 
a type that is absent from the untransf ected host cell or is 
of a magnitude and/or pharmacological properties or exhibits 
biophysical properties not exhibited in the untransf ected 
cell. 

The eukaryotic cells can be transfected with various 
combinations of the subunit subtypes provided herein. The 
resulting cells will provide a uniform population of calcium 
channels for study of calcium channel activity and for use in 
the drug screening assays provided herein. Experiments that 
have been performed have demonstrated the inadequacy of prior 
classification schemes . 

Preferred among transfected cells is a recombinant 
eukaryotic cell with a functional heterologous calcium 
channel . The recombinant cell can be produced by introduction 
of and expression of heterologous DNA or RNA transcripts 
encoding an a, subunit of a human calcium channel, more 
preferably also expressing, a heterologous DNA encoding a p 
subunit of a human calcium channel and/or heterologous DNA 
encoding an a 2 subunit of a human calcium channel. Especially 
preferred is the expression in such a recombinant cell of each 
of the a lt 0 and ot 2 subunits encoded by such heterologous DNA 
or RNA transcripts, and optionally expression of heterologous 
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DNA or an RNA transcript encoding a 7 subunit of a human 
calcium channel. The functional calcium channels may 

preferably include at least an <v, subunit and a /S subunit of 
a human calcium channel. Eukaryotic cells expressing these 
two subunit s and also cells expressing additional subunits, 
have been prepared by transfection of DNA and by injection of 
RNA transcripts. Such cells have exhibited voltage -dependent 
calcium channel activity attributable to calcium channels that 
contain one or more of the heterologous human calcium channel 
subunits. For example, eukaryotic cells expressing 

heterologous calcium channels containing an a 2 subunit in 
addition to the subunit and a 0 subunit have been shown to 
exhibit increased calcium selective ion flow across the 
cellular membrane in response to depolarization, indicating 
that the ar 2 subunit may potentiate calcium channel function. 
Cells that have been co-transf ected with increasing ratios of 
or 2 to a, and the activity of the resulting calcium channels has 
been measured. The results indicate that a 2 increasing the 
amount of or 2 -encoding DNA relative to the other transfected 
subunits increases calcium channel activity. 

Eukaryotic cells which express heterologous calcium 
channels containing at least a human a x subunit, a human /3 
subunit and a human or 2 subunit are preferred. Eukaryotic 
cells transformed with a composition containing cDNA or an RNA 
transcript that encodes an subunit alone or in combination 
with a /3 and/or an a 2 subunit may be used to produce cells 
that express functional calcium channels. Since recombinant 
cells expressing human calcium channels containing all of the 
human subunits encoded by the heterologous cDNA or RNA are 
especially preferred, it is desirable to inject or transfect 
such host cells with a sufficient concentration of the 
subunit -encoding nucleic acids to form calcium channels that 
contain the human subunits encoded by heterologous DNA or RNA. 
The precise amounts and ratios of DNA or RNA encoding the 
subunits may be empirically determined and optimized for a 



WO 95/04822 



PCT/US94/09230 



-45- 



particular combination of subunits, cells and assay 
conditions . 

In particular, mammalian cells have been transiently and 
stably tranf ected with DNA encoding one or more human calcium 
channel subunits. Such cells express heterologous calcium 
channels that exhibit pharmacological and electrophysiological 
properties that can be ascribed to human calcium channels. 
Such cells, however, represent homogeneous populations and the 
pharmacological and electrophysiological data provides 
insights into human calcium channel activity heretofore 
unattainable. For example, HEK cells that have been 
transiently transf ected with DNA encoding the a lE . lf a 2b , and /3 xo 
subunits. The resulting cells transiently express these 
subunits, which form calcium channels that have properties 
that appear to be a pharmacologically distinct class of 
voltage-activated calcium channels distinct from those of L- , 
N- f T- and P-type channels. The observed of 1E currents were 
insensitive to drugs and toxins previously used to define 
other classes of voltage-activated calcium channels. 

HEK cells that have been transfiently transfected with 
DNA encoding 0£ 1B _ lf a 2b , and fi lm2 express heterologous calcium 
channels that exhibt sensitivity to w-conotoxin and currents 
typical of N-type channels. It has been found that alteration 
of the molar raios of c* 1B _ a , a 2h and introduced into the 

cells into to achieve equivalent mRNA levels significantly 
increased the number of receptors per cell, the current 
density, and affected the K d for w-conotoxin. 

The electrophyiological properties of these channels 
produced from c^, a 2b , and fi lm2 was compared with those of 
channels produced by transiently transfecting HEK cells with 
DNA encoding * 2b and 0 1O . The channels exhibited similar 

voltage dependence of activation, substantially identical 
voltage dependence, similar kinetics of activation and tail 
currents that could be fit by a single exponential. The 
voltage dependence of the kinetics of inactivation was 
significantly different at all voltages examined. 
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In certain embodiments, the eukaryotic cell with a 
heterologous calcium channel is produced by introducing into 
the cell a first composition, which contains at least one RNA 
transcript that is translated in the cell into a subunit of a 
human calcium channel. In preferred embodiments, the subunits 
that are translated include an a, subunit of a human calcium 
channel. More preferably, the composition that is introduced 
contains an RNA transcript which encodes an a, subunit of a 
human calcium channel and also contains (l) an RNA transcript 
which encodes a fi subunit of a human calcium channel and/or 
(2) an RNA transcript which encodes an a 2 subunit of a human 
calcium channel . Especially preferred is the introduction of 
RNA encoding an <* x , a fi and an a 2 human calcium channel 
subunit, and, optionally, a y subunit of a human calcium 
channel. Methods for in vitro transcription of a cloned 

DNA and injection of the resulting RNA into eukaryotic cells 
are well known in the art. Transcripts of any of the full- 
length DNA encoding any of the subunits of a human calcium 
channel may be injected alone or in combination with other 
transcripts into eukaryotic cells for expression in the cells. 
Amphibian odcytes are particularly preferred for expression of 
in vitro transcripts of the human calcium channel subunit cDNA 
clones provided herein. Amphibian oocytes that express 
functional heterologous calcium channels have been produced by 
this method. 

Assays and Clinical uses of the cells and calcium channels 
Assays 

Assays for identifying compounds that 
modulate calcium channel activity 

Among the uses for eukaryotic cells which recombinantly 

express one or more subunits are assays for determining 

whether a test compound has calcium channel agonist or 

antagonist activity. These eukaryotic cells may also be used 

to select from among known calcium channel agonists and 

antagonists those exhibiting a particular calcium channel 
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subtype specificity and to thereby select compounds that have 
potential as disease- or tissue-specific therapeutic agents. 

In vitro methods for identifying compounds, such as 
calcium channel agonist and antagonists, that modulate the 
activity of calcium channels using eukaryotic cells that 
express heterologous human calcium channels are provided. 

In particular, the assays use eukaryotic cells that 
express heterologous human calcium channel subunits encoded by 
heterologous DNA provided herein, for screening potential 
calcium channel agonists and antagonists which are specific 
for human calcium channels and particularly for screening for 
compounds that are specific for particular human calcium 
channel subtypes. Such assays may be used in conjunction with 
methods of rational drug design to select among agonists and 
antagonists, which differ slightly in structure, those 
particularly useful for modulating the activity of human 
calcium channels, and to design or select compounds that 
exhibit subtype- or tissue- specific calcium channel 
antagonist and agonist activities. These assays should 
accurately predict the relative therapeutic efficacy of a 
compound for the treatment of certain disorders in humans . In 
addition, since subtype-and tissue-specific calcium channel 
subunits are provided, cells with tissue- specific or subtype- 
specific recombinant calcium channels may be prepared and used 
in assays for identification of human calcium channel tissue- 
or subtype -specific drugs. 

Desirably, the host cell for the expression of calcium 
channel subunits does not produce endogenous calcium channel 
subunits of the type or in an amount that substantially 
interferes with the detection of heterologous calcium channel 
subunits in ligand binding assays or detection of heterologous 
calcium channel function, such as generation of calcium 
current, in functional assays. Also, the host cells 

preferably should not produce endogenous calcium channels 
which detectably interact with compounds having, at 
physiological concentrations (generally nanomolar or picomolar 
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concentrations), affinity for calcium channels that contain 
one or all of the human calcium channel subunits provided 
herein. 

With respect to ligand binding assays for identifying a 
compound which has affinity for calcium channels, cells are 
employed which express, preferably, at least a heterologous a, 
subunit. Transfected eukaryotic cells which express at least 
an ttl subunit may be used to determine the ability of a test 
compound to specifically bind to heterologous calcium channels 
by, for example, evaluating the ability of the test compound 
to inhibit the interaction of a labeled compound known to 
specifically interact with calcium channels. Such ligand 
binding assays may be performed on intact transfected cells or 
membranes prepared therefrom. 

The capacity of a test compound to bind to or otherwise 
interact with membranes that contain heterologous calcium 
channels or subunits thereof may be determined by using any 
appropriate method, such as competitive binding analysis, such 
as Scat chard plots, in which the binding capacity of such 
membranes is determined in the presence and absence of one or 
more concentrations of a compound having known affinity for 
the calcium channel. where necessary, the results may be 
compared to a control experiment designed in accordance with 
methods known to those of skill in the art. For example, as 
a negative control, the results may be compared to those of 
assays of an identically treated membrane preparation from 
host cells which have not been transfected with one or more 
subunit -encoding nucleic acids. 

The assays involve contacting the cell membrane of a 
recombinant eukaryotic cell which expresses at least one 
subunit of a human calcium channel, preferably at least an a, 
subunit of a human calcium channel, with a test compound and 
measuring the ability of the test compound to specifically 
bind to the membrane or alter or modulate the activity of a 
heterologous calcium channel on the membrane. 
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In preferred embodiments, the assay uses a recombinant 
cell that has a calcium channel containing an a x subunit of a 
human calcium channel in combination with a 0 subunit of a 
human calcium channel and/or an a 2 subunit of a human calcium 
channel. Recombinant cells expressing heterologous calcium 
channels containing each of the or a , P and at 2 human subunits, 
and, optionally, a y subunit of a human calcium channel are 
especially preferred for use in such assays. 

In certain embodiments, the assays for identifying 
compounds that modulate calcium channel activity are practiced 
by measuring the calcium channel activity of a eukaryotic cell 
having a heterologous, functional calcium channel when such 
cell is exposed to a solution containing the test compound and 
a calcium channel -selective ion and comparing the measured 
calcium channel activity to the calcium channel activity of 
the same cell or a substantially identical control cell in a 
solution not containing the test compound. The cell is 
maintained in a solution having a concentration of calcium 
channel -selective ions sufficient to provide an inward current 
when the channels open. Rcombinant cells expressing calcium 
channels that include each of the o 1( P and a 2 human subunits, 
and, optionally, a y subunit of a human calcium channel, are 
especially preferred for use in such assays. Methods for 
practicing such assays are known to those of skill in the art. 
For example, for similar methods applied with Xenopus laevis 
oocytes and acetylcholine receptors, see, Mishina et al . 
[(1985) Nature 323:364] and, with such oocytes and sodium 
channels [see, Noda et al. (1986) Nature 322:826-828]. For 
similar studies which have been carried out with the 
acetylcholine receptor, see, e.g., Claudio et al . [(1987) 
Science 236:1688-1694] . 

Functional recombinant or heterologous calcium channels 
may be identified by any method known to those of skill in the 
art. For example, electrophysiological procedures for 
measuring the current across an ion-selective membrane of a 
cell, which are well known, may be used. The amount and 
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duration of the flow of calcium-selective ions through 
heterologous calcium channels of a recombinant cell containing 
DNA encoding one or more of the subunits provided herein has 
been measured using electrophysiological recordings using a 
two electrode and the whole-cell patch clamp techniques. In 
order to improve the sensitivity of the assays, known methods 
can be used to eliminate or reduce non- calcium currents and 
calcium currents resulting from endogenous calcium channels, 
when measuring calcium currents through recombinant channels. 

For example, the DHP Bay K 8644 specifically enhances L-type 
calcium channel function by increasing the duration of the 
open state of the channels [see, e.g., Hess, J.B., et al. 
(1984) Mature 311:538-544] . Prolonged opening of the channels 
results in calcium currents of increased magnitude and 
duration. Tail currents can be observed upon repolarization 
of the cell membrane after activation of ion channels by a 
depolarizing voltage command. The opened channels require a 
finite time to close or "deactivate" upon repolarization, and 
the current that flows through the channels during this period 
is referred to as a tail current. Because Bay K 8644 prolongs 
opening events in calcium channels, it tends to prolong these 
tail currents and make them more pronounced. 

In practicing these assays, stably or transiently 
transfected cells or injected cells that express voltage- 
dependent human calcium channels containing one or more of the 
subunits of a human calcium channel desirably may be used in 
assays to identify agents, such as calcium channel agonists 
and antagonists, that modulate calcium channel activity. 
Functionally testing the activity of test compounds, including 
compounds having unknown activity, for calcium channel agonist 
or antagonist activity to determine if the test compound 
potentiates, inhibits or otherwise alters the flow of calcium 
ions or other ions through a human calcium channel can be 
accomplished by (a) maintaining a eukaryotic cell which is 
transfected or injected to express a heterologous functional 
calcium channel capable of regulating the flow of calcium 
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channel- selective ions into the cell in a medium containing 
calcium channel -selective ions (i) in the presence of and (ii) 
in the absence of a test compound; (b) maintaining the cell 
under conditions such that the heterologous calcium channels 
are substantially closed and endogenous calcium channels of 
the cell are substantially inhibited (c) depolarizing the ; 
membrane of the cell maintained in step (b) to an extent and 
for an amount of time sufficient to cause (preferably, 
substantially only) the heterologous calcium channels to 
become permeable to the calcium channel-selective ions; and 
(d) comparing the amount and duration of current flow into the 
cell in the presence of the test compound to that of the 
current flow into the cell, or a substantially similar cell, 
in the absence of the test compound. 

The assays thus use cells, provided herein, that express 
heterologous functional calcium channels and measure 
functionally, such as electrophysiological^ , the ability of 
a test compound to potentiate, antagonize or otherwise 
modulate the magnitude and duration of the flow of calcium 
channel -selective ions, such as Ca ++ or Ba**, through the 
heterologous functional channel. The amount of current which 
flows through the recombinant calcium channels of a cell may 
be determined directly, such as electrophysiological^ , or by 
monitoring an independent reaction which occurs 
intracellularly and which is directly influenced in a calcium 
(or other) ion dependent manner. Any method for assessing 

the activity of a calcium channel may be used in conjunction 
with the cells and assays provided herein. For example, 

in one embodiment of the method for testing a compound for its 
ability to modulate calcium channel activity, the amount of 
current is measured by its modulation of a reaction which is 
sensitive to calcium channel -selective ions and uses a 
eukaryotic cell which expresses a heterologous calcium channel 
and also contains a transcriptional control element 
operatively linked for expression to a structural gene that 
encodes an indicator protein. The transcriptional control 
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element used for transcription of the indicator gene is 
responsive in the cell to a calcium channel -selective ion, 
such as Ca 2+ and Ba*. The details of such transcriptional 
based assays are described in commonly owned PCT International 
Patent Application No. PCT/US91/5625 , filed August 7, 1991, 
which claims priority to copending commonly owned allowed U.S. 
Application Serial No. 07/563,751, filed August 7, 1990; see 
also, commonly owned published PCT International Patent 
Application . PCT US92/11090, which corresponds to co-pending 
U.S. Applications Serial Nos. 08/229,150 and 08/244,985. 
Assays for diagnosis of LES 

LES is an autoimmune disease characterized by an 
insufficient release of acetylcholine from motor nerve 
terminals which normally are responsive to nerve impulses. 
Immunoglobulins (IgG) from LES patients block individual 
volt age -dependent calcium channels and thus inhibit calcium 
channel activity [Kim and Neher, Science 235:405-408 (1988)] . 
A diagnostic assay for Lambert Eaton Syndrome (LES) is 
provided herein. The diagnostic assay for LES relies on the 
immunological reactivity of LES IgG with the human calcium 
channels or particular subunits alone or in combination or 
expressed on the surface of recombinant cells. For example, 
such an assay may be based on immunoprecipitation of LES IgG 
by the human calcium channel subunits and cells that express 
such subunits provided herein. 

Clinical applications 

In relation to therapeutic treatment of various disease 
states, the availability of DNA encoding human calcium channel 
subunits permits identification of any alterations in such 
genes ( e.g. . mutations) which may correlate with the 
occurrence of certain disease states. In addition, the 
creation of animal models of such disease states becomes 
possible, by specifically introducing such mutations into 
synthetic DNA fragments can then be introduced into laboratory 
animals or in vitro assay systems to determine the effects 
thereof . 
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Also, genetic screening can be carried out using the 
nucleotide sequences as probes. Thus, nucleic acid samples 
from subjects having pathological conditions suspected of 
involving alteration/modification of any one or more of the 
calcium channel subunits can be screened with appropriate 
probes to determine if any abnormalities exist with respect to 
any of the endogenous calcium channels. Similarly, subjects 
having a family history of disease states related to calcium 
channel dysfunction can be screened to determine if they are 
also predisposed to such disease states. 

EXAMPLES 

The following examples are' included for illustrative 
purposes only and are not intended to limit the scope of the 
invention. 

EXAMPLE I: PREPARATION OF LIBRARIES USED FOR ISOLATION 

OF DNA ENCODING HUMAN NEURONAL VOLTAGE - 
DEPENDENT CALCIUM CHANNEL SUBUNITS 

A. RNA Isolation 

1. IMR32 cells 

IMR32 cells were obtained from the American Type Culture 
Collection (ATCC Accession No. CCL127, Rockville, MD) and 
grown in DMEM, 10% fetal bovine serum, 1% 
penicillin/streptomycin (GIBCO, Grand Island, NY) plus 1.0 mM 
dibutyryl cAMP (dbcAMP) for ten days. Total RNA was isolated 
from the cells according to the procedure described by H.C. 
Birnboim [(1988) Nucleic Acids Research 16:1487-1497]. 
Poly (A*) RNA was selected according to standard procedures 
[ see/ e.g., Sambrook et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press; pg. 
7.26-7.29] . 

2. Human thalamus tissue 

Human thalamus tissue (2.34 g) , obtained from the 
National Neurological Research Bank, Los Angeles, CA, that had 
been stored frozen at -70°C was pulverized using a mortar and 
pestle in the presence of liquid nitrogen and the cells were 
lysed in 12 ml of lysis buffer (5 M guanidinium 
isothiocyanate, 50 mM TRIS, pH 7.4, 10 mM EDTA, 5% p- 
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mercaptoethanol ) . Lysis buffer was added to the lysate to 
yield a final volume of 17 ml. N-laurylsarcosine and CsCl 
were added to the mixture to yield final concentrations of 4% 
and 0.01 g/ml , respectively, in a final volume of 18 ml. 

The sample was centrifuged at 9,000 rpm in a Sorvall SS34 
rotor for 10 min at room temperature to remove the insoluble 
material as a pellet. The supernatant was divided into two 
equal portions and each was layered onto a 2 -ml cushion of a 
solution of 5.7 M CsCl, 0.1 M EDTA contained in separate 
centrifuge tubes to yield approximately 9 ml per tube. The 
samples were centrifuged in an SW41 rotor at 37,000 rpm for 24 
h at 20°C. 

After centrifugation, each RNA pellet was resuspended in 
3 ml ETS (10 mM TRIS, pH 7 . 4 , 10 mM EDTA, 0.2% SDS) and 
combined into a single tube. The RNA was precipitated with 
0.25 M NaCl and two volumes of 95% ethanol . 

The precipitate was collected by centrifugation and 
resuspended in 4 ml PK buffer (0.05 M TRIS, pH 8.4, 0.14 M 
NaCl, 0.01 M EDTA, 1% SDS). Proteinase K was added to the 
sample to a final concentration of 200 fig/ml . The sample was 
incubated at 22 °C for 1 h, followed by extraction with an 
equal volume of phenol : chloroform: isoamyl alcohol (50:48:2) two 
times, followed by one extraction with an equal volume of 
chloroform: isoamyl alcohol (24:1). The RNA was precipitated 
with ethanol and NaCl. The precipitate was resuspended in 400 
Ml of ETS buffer. The yield of total RNA was approximately 
1.0 mg. Poly A* RNA (30 /zg) was isolated from the total RNA 
according to standard methods as stated in Example I.A.I. 
B. Library Construction 

Double -stranded cDNA was synthesized according to 
standard methods [see, e.g., Sambrook et al . (1989) IN: 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Chapter 8] . Each library was prepared in 
substantially the same manner except for differences in: 
1) the oligonucleotide used to prime the first strand cDNA 
synthesis, 2) the adapters that were attached to the double- 
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stranded cDNA, 3) the method used to remove the free or unused 
adapters, and 4) the size of the fractionated cDNA ligated 
into the X phage vector. 

1. IMR32 cDNA library #1 
Single -stranded cDNA was synthesized using IMR32 poly(A + ) 
RNA (Example I.A.I.) as a template and was primed using oligo 
(dT) l2 . 18 (Collaborative Research Inc., Bedford, MA); The 
single -stranded cDNA was converted to double -stranded cDNA and 
the yield was approximately 2fig. Ecol adapters: 

5 ' -AATTCGGTACGTACACTCGAGC- 3 ' = 22 -mer ( SEQ ID No . 1 5 ) 
3'- GCCATGCATGTGAGCTCG- 5 ' = 18-mer (SEQ ID No. 16) 
also containing SnaBI and Xhol restriction sites were then 
added to the double -stranded cDNA according to the following 
procedure . 

a. Phosphorylation of 18-mer 

The 18-mer was phosphorylated using standard methods 
[see, e.g., Sambrook et al . (1989) IN: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Chapter 8] by combining in a 10 /il total volume the 18-mer 
(225 pmoles) with [ 32 P]t-ATP (7000 Ci/mmole; 1.0 pi) and kinase 
(2 U) and incubating at 37° C for 15 minutes. After 
incubation, 1 pi 10 mM ATP and an additional 2 U of kinase 
were added and incubated at 37°C for 15 minutes. 
Kinase was then inactivated by boiling for 10 minutes. 

b. Hybridization of 22 -mer 

The 22 -mer was hybridized to the phosphorylated 18-mer 
by addition of 225 pmoles of the 22 -mer (plus water to bring 
volume to 15 pi), and incubation at 65°C for 5 minutes. The 
reaction was then allowed to slow cool to room temperature. 

The adapters were thus present at a concentration of 15 
pmoles/Ml/ and were ready for cDNA-adapter ligation. 

c. Ligation of adapters to cDNA 

After the EcoRI , SnaBI, Xhol adapters were ligated to the 
double-stranded cDNA using a standard protocol [see, e.g., 
Sambrook et al . (198 9) IN: Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Chapter 8] , the 
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ligase was inactivated by heating the mixture to 72°C for 15 
minutes. The following reagents were added to the cDNA 
ligation reaction and heated at 37°C for 30 minutes: cDNA 
ligation reaction (20 /il) , water (24 jxl) , lOx kinase buffer (3 
111), 10 mM ATP (1 ill) and kinase (2/il of 2 U//il) . The 
reaction was stopped by the addition of 2 fil 0 . 5M EDTA, 
followed by one phenol/chloroform extraction and one 
chloroform extraction. 

d. Size Selection and Packaging of cDNA 

The double -stranded cDNA with the EcoRl , SnaBI, Xhol 
adapters ligated was purified away from the free or unligated 
adapters using a 5 ml Sepharose CL-4B column (Sigma, St. 
Louis, MO) . 100 fil fractions were collected and those 
containing the cDNA, determined by monitoring the 
radioactivity, were pooled, ethanol precipitated, resuspended 
in TE buffer and loaded onto a 1% agarose gel . After the 
electrophoresis, the gel was stained with ethidium bromide and 
the 1 to 3 kb fraction was cut from the gel. The cDNA 
embedded in the agarose was eluted using the "Geneluter 
Electroelution System" (Invitrogen, San Diego, CA) . The 
eluted cDNA was collected by ethanol precipitation and 
resuspended in TE buffer at 0.10 pmol//il. The cDNA was 
ligated to 1 fig of EcoRI digested, dephosphorylated Xgtll in 
a 5 /il reaction volume at a 2- to 4- fold molar excess ratio 
of cDNA over the Xgtll vector. The ligated Xgtll containing 
the cDNA insert was packaged into X phage virions in vitro 
using the Gigapack (Stratagene, La Jolla, CA) kit. The 
packaged phage were plated on an E. coli Y1088 bacterial lawn 
in preparation for screening. 

2. IMR32 cDNA library #2 

This library was prepared as described (Example I.B.I.) 
with the exception that 3 to 9 kb cDNA fragments were ligated 
into the Xgtll phage vector rather than the 1 to 3 kb 
fragments . 
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3. IMR32 cDNA library #3 

IMR32 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single-stranded cDNA. The primers for 
the first strand cDNA synthesis were random primers 
(hexadeoxy-nucleotides [pd(N)J Cat #5020-1, Clontech, Palo 
Alto, CA) . The double -stranded cDNA was synthesized, EcoRI, 
SnaBI, Xhol adapters were added to the cDNA, the unligated 
adapters were removed, and the double-stranded cDNA with the 
ligated adapters was fractionated on an agarose gel, as 
described in Example I.B.I. The cDNA fraction greater than 

I. 8 kb was eluted from the agarose, ligated into Xgtll, 
packaged, and plated into a bacterial lawn of Y108 8 (as 
described in Example I.B.I.) . 

4. IMR32 cDNA library #4 

IMR32 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single-stranded cDNA. The primers for 
the first strand cDNA synthesis were oligonucleotides: 89-365a 
specific for the a 1D (VDCC III) type o^-subunit (see Example 

II. A.) coding sequence (the complementary sequence of nt 2927 
to 2956, SEQ ID No. 1), 89-495 specific for the a lc (VDCC II) 
type c^-subunit (see Example II. B.) coding sequence (the 
complementary sequence of nt 852 to 873, SEQ ID No. 3), and 
90-12 specific for the a ic -subunit coding sequence (the 
complementary sequence of nt 2496 to 2520, SEQ ID No. 3) . The 
cDNA library was then constructed as described (Example 
I.B.3) , except that the cDNA size -fraction greater than 1.5 kb 
was eluted from the agarose rather than the greater than 
1.8 kb fraction. 

5. IMR32 cDNA library #5 

The cDNA library was constructed as described (Example 
I.B.3.) with the exception that the size-fraction greater than 
1.2 kb was eluted from the agarose rather than the greater 
than 1.8 kb fraction. 

6. Human thalamus cDNA library #6 

Human thalamus poly (A + ) RNA (Example I. A. 2.) was used as 
a template to synthesize single-stranded cDNA. Oligo (dT) was 
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used to prime the first strand synthesis (Example I.B.I.) . 
The double-stranded cDNA was synthesized (Example I.B.I.) and 
EcoRI, Kpnl, Ncol adapters of the following sequence: 
5' CCATGGTACCTTCGTTGACG 3'= 20-mer (SEQ ID NO. 17) 
3' GGTACCATGGAAGCAACTGCTTAA 5'= 24-mer (SEQ ID NO. 18) 
were ligated to the double- stranded cDNA as described (Example 
I.B.I.) with the 20-mer replacing the 18-mer and the 24-mer 
replacing the 22-mer. The unligated adapters were removed by 
passing the cDNA-adapter mixture through a 1 ml Bio Gel A-50 
(Bio-Rad Laboratories, Richmond, CA) column. Fractions (3 0 
Ml) were collected and 1 fxl of each fraction in the first peak 
of radioactivity was electrophoresed on a 1% agarose gel . 
After electrophoresis, the gel was dried on a vacuum gel drier 
and exposed to x-ray film. The fractions containing cDNA 
fragments greater than 600 bp were pooled, ethanol 
precipitated, and ligated into Xgtll (Example I.B.I.) . The 
construction of the cDNA library was completed as described 
(Example I.B.I.) . 

C. Hybridization and Washing Conditions 
Hybridization of radiolabeled nucleic acids to 
immobilized DNA for the purpose of screening cDNA libraries, 
DNA Southern transfers, or northern transfers was routinely 
performed in standard hybridization conditions 
[hybridization: 50% deionized formamide, 200 ftg/ml sonicated 
herring sperm DNA (Cat #223646, Boehringer Mannheim 
Biochemicals, Indianapolis, IN), 5 x SSPE, 5 x Denhardt ' s , 42° 
C; wash :0.2 x SSPE, 0.1% SDS, 65° C] . The recipes for SSPE 
and Denhardt 's and the preparation of deionized formamide are 
described, for example, in Sambrook et al . (1989) Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Chapter 8) . In some hybridizations, lower stringency 
conditions were used in that 10% deionized formamide replaced 
50% deionized formamide described for the standard 
hybridization conditions. 
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The washing conditions for removing the non-specific 
probe from the filters was either high, medium, or low 
stringency as described below: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50 °C 

3) low stringency: 1.0 x SSPE, 0.1% SDS, 50°C. 

It is understood that equivalent stringencies may be achieved 

using alternative buffers, salts and temperatures. 

EXAMPLE II: ISOLATION OF DNA ENCODING THE HUMAN NEURONAL 
CALCIUM CHANNEL SUBUNIT 

A. Isolation of DNA encoding the a 1D subunit 

1. Reference list of partial a 1D cDNA clones 

Numerous a? 1D -specif ic cDNA clones were isolated in order 
to characterize the complete a 1D coding sequence plus portions 
of the 5' and 3' untranslated sequences. SEQ ID No. 1 shows 
the complete of 1D DNA coding sequence, plus 510 nucleotides of 
a 1D 5' untranslated sequence ending in the guanidine nucleotide 
adjacent to the adenine nucleotide of the proposed initiation 
of translation as well as 642 nucleotides of 3' untranslated 
sequence. Also shown in SEQ ID No. 1 is the deduced amino 
acid sequence. A list of partial cDNA clones used to 

characterize the a 1D sequence and the nucleotide position of 
each clone relative to the full-length a 1D cDNA sequence, which 
is set forth in SEQ ID No. 1, is shown below. The isolation 
and characterization of these clones are described below 
(Example II .A.2. ) . 

nt 1 to 510 of SEQ ID No. 1 

5' untranslated sequence, 
nt 511 to 2431, SEQ ID No. 1 

nt 1627 to 2988, SEQ ID No . 1 

nt 1 to 104 of SEQ ID No. 2 
additional exon, 

nt 2083 to 6468, SEQ ID No. 1 

nt 2857 to 4281, SEQ ID No . 1 

nt 5200 to 7635, SEQ ID No. 1 



IMR32 



IMR32* 



IMR32® 

IMR32* 

IMR32 



1 .144 



1.136 



1.80 
1.36 
1.163 
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* 5' of nt 1627, IMR32 1.136 encodes an intron and an 
additional exon described in Example II.A.2.d. 

@ IMR32 1.80 contains two deletions, nt 2984 to 3131 
and nt 5303 to 5349 (SEQ ID No. 1) . The 148 nt 
deletion (nt 2984 to 3131) was corrected by 
performing a polymerase chain reaction described in 
Example II.A.3.b. 
# IMR32 1.36 contains a 132 nt deletion (nt 3081 to 
3212) . 

2 . Isolation and characterization of individual 
clones listed in Example I J.A.I. 

a. IMR32 1.36 

Two million recombinants of the IMR32 cDNA library #1 
(Example I.B.I.) were screened in duplicate at a density of 
approximately 200,000 plaques per 150 mm plate using a mixture 
of radiolabelled fragments of the coding region of the rabbit 
skeletal muscle calcium channel cDNA [for the sequence of 
the rabbit skeletal muscle calcium channel subunit cDNA, 
see, Tanabe et ai. (1987). Nature 328:313- 

318] : Fragment Nucleotides 

JCpnI-£coRI -78 to 1006 

EcoRI-Xhol 1006 to 2653 

Apal-Apal 3 093 to 4182 

Bgrlll-Sad 4487 to 5310 

The hybridization was performed using low stringency 
hybridization conditions (Example I.e.) and the filters were 
washed under low stringency (Example I.e.). Only one ar 1D - 
specific recombinant (IMR32 1.36) of the 2 x 10 6 screened was 
identified. IMR32 1.36 was plaque purified by standard 
methods (J. Sambrook et al . (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Chapter 8) subcloned into pGEM3 (Promega, Madison, WI) and 
characterized by DNA sequencing. 

b. IMR32 1.80 

Approximately 1 x 10 6 recombinants of the IMR32 cDNA 
library #2 (Example I.B.2.) were screened in duplicate at a 
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density of approximately 100,000 plagues per 150 mm plate 
using the IMR32 1.3 6 cDNA fragment (Example I I.A.I) as a 
probe. Standard hybridization conditions were used, and the 
filters were washed under high stringency (Example I.e.). 
Three positive plagues were identified one of which was IMR32 
1.80. IMR32 1.80 was plague purified by standard methods, 
restriction mapped, subcloned, and characterized by DNA 
seguencing. 

c. IMR32 1.144 

Approximately 1 x 10 6 recombinants of the IMR32 cDNA 
library #3 (Example I.B.3) were screened with the EcoRI-PvuII 
fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80. The 
hybridization was performed using standard hybridization 
conditions (Example I.e.) and the filters were washed under 
high stringency (Example I.e.). Three positive plagues were 
identified one of which was IMR32 1.144. IMR32 1.144 was 
plague purified, restriction mapped, and the cDNA insert was 
subcloned into pGEM7Z (Promega, Madison, WI) and characterized 
by DNA seguencing. This characterization revealed that IMR32 
1.144 has a series of ATG codons encoding seven possible 
initiating methionines (nt 511 to 531, SEQ ID No. 1) . Nucleic 
acid amplification analysis, and DNA seguencing of cloned 
nucleic acid amplification analysis products encoding these 
seven ATG codons confirmed that this seguence is present in 
the o? 1D transcript expressed in dbcAMP- induced IMR32 cells. 

d. IMR32 1.136 

Approximately l x 10 6 recombinants of the IMR32 cDNA 
library #4 (Example I.B.4) were screened with the £coRI-PvuII 
fragment (nt 2083 to 2518, SEQ ID No. 1) of IMR32 1.80 
(Example II. A. 1.). The hybridization was performed using 
standard hybridization conditions (Example I.e.) and the 
filters were washed under high stringency (Example I.e.) . six 
positive plagues were identified one of which was IMR32 1.136. 
IMR32 1.136 was plague purified, restriction mapped, and the 
cDNA insert was subcloned into a standard plasmid vector, 
pSP72 (Promega, Madison, WI . ) , and characterized by DNA 
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sequencing. This characterization revealed that IMR32 1.136 
encodes an incompletely spliced a 1D transcript. The clone 
contains nucleotides 1627 to 2988 of SEQ ID No. 1 preceded by 
an approximate 640 bp intron. This intron is then preceded by 
a 104 nt exon (SEQ ID No. 2) which is an alternative exon 
encoding the IS6 transmembrane domain [see, e.g., Tanabe et 
al. (1987) Mature 325:313-318 for a description of the IS1 to 
IVS6 transmembrane terminology] of the a 1D subunit and can 
replace nt 1627 to 1730, SEQ ID No. 1, to produce a completely 
spliced a 1D transcript. 

e. IMR32 1.163 
Approximately 1 x 10 6 recombinants of the IMR32 cDNA 
library #3 (Example I.B.3.) were screened with the Ncol-Xhol 
fragment of IMR32 1.80 (Example II. A. 1.) containing nt 5811 to 
6468 (SEQ ID No. 1) . The hybridization was performed using 
standard hybridization conditions (Example I.C.) and the 
filters were washed under high stringency (Example I.C). 
Three positive plagues were identified one of which was IMR32 
1.163. IMR32 1.163 was plague purified, restriction mapped, 
and the cDNA insert was subcloned into a standard plasmid 
vector, pSP72 (Promega, Madison, WI . ) , and characterized by 
DNA seguencing. This characterization revealed that IMR32 
1.163 contains the c* 1D termination codon, nt 6994 to 6996 (SEQ 
ID No. 1) . 

3. Construction of a full-length a XD cDNA 
[pVDCCIII (A) ] 

a 1D cDNA clones IMR32 1.144, IMR32 1.13 6, IMR32 1.80, and 
IMR32 1.163 (Example II. A. 2.) overlap and include the entire 
0f 1D coding sequence, nt 511 to 6993 (SEQ ID No. 1), with the 
exception of a 148 bp deletion, nt 2984 to 3131 (SEQ ID No. 
1) . Portions of these partial cDNA clones were ligated to 
generate a full-length a 1D cDNA in a eukaryotic expression 
vector. The resulting vector was called pVDCCIII (A) . The 
construction of pVDCCIII (A) was performed in four steps 
described in detail below: (1) the construction of 

pVDCCIII/5' using portions of IMR32 1.144, IMR32 1.136, and 
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IMR32 1.80, (2) the construction of pVDCCIII/5 ' . 3 that 
corrects the 148 nt deletion in the IMR32 1.80 portion of 
pVDCCIII/5', (3) the construction of pVDCCIII/3 ' . 1 using 
portions of IMR32 1.80 and IMR3 2 1.163, and (4) the ligation 
of a portion of the pVDCCIII/5' .3 insert, the insert of 
pVDCCIII/3' .1, and pcDNAl (Invitrogen, San Diego, CA) to form 
pVDCCIII(A). The vector pcDNAl is a eukaryotic expression 
vector containing a cytomegalovirus (CMV) promoter which is a 
constitutive promoter recognized by mammalian host cell RNA 
polymerase II. 

Each of the DNA fragments used in preparing the full- 
length construct was purified by electrophoresis through an 
agarose gel onto DE81 filter paper (Whatman, Clifton, NJ) and 
elution from the filter paper using 1.0 M NaCl, 10 mM TRIS, pH 
8.0, 1 mM EDTA. The ligations typically were performed in a 
10 pi reaction volume with an equal molar ratio of insert 
fragment and a two-fold molar excess of the total insert 
relative to the vector. The amount of DNA used was normally 
about 50 ng to 100 ng. 

a. pVDCCIII/5' 

To construct pVDCCIII/5', IMR32 1.144 (Example II.A.2.C.) 
was digested with Xhol and EcoRl and the fragment containing 
the vector (pGEM7Z) , a 1D nt 1 to 510 (SEQ ID No. l), and a 1D nt 
511 to 1732 (SEQ ID No. 1) was isolated by gel 
electrophoresis. The EcoRI-Apal fragment of IMR32 1.136 
(Example II.A.2.d.) nucleotides 1733 to 2671 (SEQ ID No. 1) 
was isolated, and the Apal-Hindlll fragment of IMR32 1.80 
(Example II.A.2.b.), nucleotides 2672 to 4492 (SEQ ID No. 1) 
was isolated. The three DNA clones were ligated to form 
pVDCCIII/5' containing nt 1 to 510 (5' untranslated sequence; 
SEQ ID No. 1) and nt 511 to 4492 (SEQ ID No. 1) . 

b. pVDCCIII/5' .3 

Comparison of the IMR32 1.36 and IMR32 1.80 DNA sequences 
revealed that these two cDNA clones differ through the a iD 
coding sequence, nucleotides 2 984 to 3212. nucleic acid 
amplification analysis of IMR32 1.80 and dbcAMP- induced 
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(1.0 mM, 10 days) IMR32 cytoplasmic RNA (isolated according to 
Ausubel, F.M. et al. (Eds) (1988) Current Protocols in 
Molecular Biology, John Wiley and Sons, New York) revealed 
that IMR32 1.80 had a 148 nt deletion, nt 2984 to 3131 {SEQ ID 
No. 1), and that IMR32 1.36 had a 132 nt deletion, nt 3081 to 
3212. To perform the nucleic acid amplification analysis, the 
amplification reaction was primed with a 1D -specific 
oligonucleotides 112 (nt 2548 to 2572, SEQ ID No. 1) and 311 

(the complementary sequence of nt 3928 to 3 957, SEQ ID No. 1) . 
These products were then reamplified using a 1D -specific 
oligonucleotides 310 (nt 2583 to 2608 SEQ ID No. 1) and 312 

(the complementary sequence of nt 3883 to 3909) . This 
reamplified product, which contains AccI and Bglll restriction 
sites, was digested with AccI and Bgrlll and the Accl-Bgrlll 
fragment, nt 2765 to 3890 (SEQ ID No. 1) was cloned into Accl- 
BgllX digested pVDCCIII/5' to replace the AccI-BgrHI 
pVDCCIII/5' fragment that had the deletion. This new 
construct was named pVDCCIII/5' .3. DNA sequence determination 
of pVDCCIII/5' .3 through the amplified region confirmed the 
148 nt deletion in IMR32 1.80. 

c. pVDCCIII/3' .1 
To construct pVDCCIII/3 ' . 1 , the cDNA insert of IMR32 
1.163 (Example II.A.2.e.) was subcloned into pBluescript II 

(Stratagene, La Jolla, CA) as an Xhol fragment. The Xhol 
sites on the cDNA fragment were furnished by the adapters used 
to construct the cDNA library (Example I.B.3.). The insert 
was oriented such that the translational orientation of the 
insert of IMR32 1.163 was opposite to that of the lacZ gene 
present in the plasmid, as confirmed by analysis of 
restriction enzyme digests of the resulting plasmid. This was 
done to preclude the possibility of expression of or 1D sequences 
in DHSof cells transformed with this plasmid due to fusion with 
the lacZ gene. This plasmid was then digested with Hindlll 
and BglU and the Hindlll - Bgrlll fragment (the HindHI site 
comes from the vector and the Bglll site is at nt 6220, SEQ ID 
No. 1) was eliminated, thus deleting nt 5200 to 6220 (SEQ ID 
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No. 1) of the IMR32 1.163 clone and removing this sequence 

from the remainder of the plasmid which contained the 3' Bglll 

Xhol fragment, nt 6221 to 7635 (SEQ ID No. 1) . 

pVDCCIII/3' .1 was then made by splicing together the Hlndlll- 

PvuII fragment from IMR32 1.80 (nucleotides 4493-5296, SEQ ID 

No. 1) , the PvuII - Bglll fragment of IMR32 1.163 (nucleotides 

5294 to 6220, SEQ ID No. 1) and the Hindu I -Bgl II -digested 

pBluescript plasmid containing the 3' Bglll/ Xhol IMR32 1.163 

fragment (nt 6221 to 7635, SEQ ID No. 1). 

d. pVDCCIII(A): the full-length a 1D 
construct 

To construct pVDCCIII(A), the Dral-Hindlll fragment (5' 
untranslated sequence nt 33 0 to 510, SEQ ID No. 1 and coding 
sequence nt 511 to 44 92, SEQ ID No. 1) of pVDCCIII/5' .3 
(Example II.A.3.b.) was isolated; the Hindlll-Xhol fragment of 
pVDCCIII/3' .1 (containing nt 4493 to 7635, SEQ ID No. 1, plus 
the Xhol site of the adapter) (Example II.A.3.C.) was 
isolated; and the plasmid vector, pcDNAl, was digested with 
EcoRV and Xhol and isolated on an agarose gel . The three DNA 
fragments were ligated and MC1061-P3 (Invitrogen, San Diego, 
CA) was transformed. Isolated clones were analyzed by 
restriction mapping and DNA sequencing and pVDCCIII (A) was 
identified which had the fragments correctly ligated together: 
DraI-J/indIII, Hindi 1 1 -Xhol , Xhol -EcoRV with the blunt -end Dral 
and EcoRV site ligating together to form the circular plasmid. 

The amino-terminus of the Qf 1D subunit is encoded by the 
seven consecutive 5' methionine codons (nt 511 to 531, SEQ ID 
No. 1) . This 5' portion plus nt 532 to 537, 

encoding two lysine residues, were deleted from pVDCCIII (A) 
and replaced with an efficient ribosomal binding site (5'- 
ACCACC-3') to form pVDCCIII .RBS (A) . Expression experiments in 
which transcripts of this construct were injected into Xenopus 
laevis oocytes did not result in an enhancement in the 
recombinant voltage-dependent calcium channel expression level 
relative to the level of expression in oocytes injected with 
transcripts of pVDCCIII (A) . 
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B. Isolation of DNA encoding the a lc subunit 

1. Reference List of Partial a lc cDNA clones 

Numerous a ic -specif ic cDNA clones were isolated in order 
to characterize the a lc coding sequence, the a lc initiation of 
translation, and an alternatively spliced region of a lc . SEQ 
ID No. 3 sets forth one ct ic coding sequence (a ic . a ) and deduced 
amino acid sequence; SEQ ID No. 36 sets forth another splice 
variant designated a lc _ 2 . SEQ ID No. 4 and No. 5 encode two 
possible amino terminal ends of an ct ic splice variant. SEQ ID 
No. 6 encodes an alternative exon for the IV S3 transmembrane 
domain. Other a ic variants can be constructed by selecting the 
alternative amino terminal ends in place of the ends in SEQ ID 
No. 3 or 36 and/or inserting the alternative exon (SEQ ID No. 
6) in the appropriate location, such as in SEQ ID NO, 3 in 
place of nucleotides 3904-3987. In addition, the 75 
nucleotide sequence (nucleotides 1391-1465 in SEQ ID No. 3) 
can be deleted or inserted to produce an alternative a ic splice 
variant . 

Shown below is a list of clones used to characterize the 
a lc sequence and the nucleotide position of each clone relative 
to the characterized a ic sequence (SEQ ID No. 3) . The 
isolation and characterization of these cDNA clones are 



described 


below 


(Example 


II .B 


.2) . 




IMR32 


1. 


66 


nt 


1 to 


916, SEQ ID No. 3 










nt 


1 to 


132, SEQ ID No. 4 




IMR32 


1. 


157 


nt 


1 to 


873, SEQ ID No. 3 










nt 


1 to 


89, SEQ ID No. 5 




IMR32 


1. 


67 


nt 


50 to 1717, SEQ ID No. 3 




*IMR32 


1. 


86 


nt 


1366 


to 2583, SEQ ID No. 


3 


®1.16G 






nt 


758 to 867, SEQ ID No. 


3 


IMR32 


1. 


37 


nt 


2804 


to 5904, SEQ ID No. 


3 


CNS 


1. 


30 


nt 


2199 


to 3903, SEQ ID No. 


3 








nt 


1 to 


84 of alternative 


exon, 












SEQ ID No. 6 




IMR32 


1 . 


38 


nt 


2448 


to 4702, SEQ ID No. 


3 








nt 


1 to 


84 of alternative 


exon, 
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SEQ ID No. 6 

* IMR32 1.86 has a 73 nt deletion compared to the rabbit 
cardiac muscle calcium channel a 7 subunit cDNA sequence. 
®1.16G is an ar lc genomic clone. 

2 . Isolation and characterization of 

clones described in Example IX.B.l. 

a. CNS 1.30 

Approximately 1 x 10 6 recombinants of the human thalamus 
cDNA library No. 6 (Example I.B.6.) were screened with 
fragments of the rabbit skeletal muscle calcium channel 
cDNA described in Example II. A. 2. a. The hybridization was 
performed using standard hybridization conditions (Example 
I.C.) and the filters were washed under low stringency 
(Example I.e.). Six positive plaques were identified, one of 
which was CNS 1.30. CNS 1.3 0 was plaque purified, restriction 
mapped, subcloned, and characterized by DNA sequencing. CNS 
1.30 encodes c* lc -specif ic sequence nt 2199 to 3903 (SEQ ID No. 
3) followed by nt 1 to 84 of one of two identified alternative 
or IC exons (SEQ ID No. 6). 3' of SEQ ID No. 6, CNS 1.30 
contains an intron and, thus, CNS 1.3 0 encodes a partially 
spliced a lc transcript. 

b. 1.16G 

Approximately 1 x 10 6 recombinants of a XEMBL3 -based 
human genomic DNA library (Cat # HL1006d Clontech Corp., Palo 
Alto, CA) were screened using a rabbit skeletal muscle cDNA 
fragment (nt -78 to 1006, Example II. A. 2. a.). The 
hybridization was performed using standard hybridization 
conditions (Example I.C.) and the filters were washed under 
low stringency (Example I.C.) . Fourteen positive plaques were 
identified, one of which was 1.16G. Clone 1.16G was plaque 
purified, restriction mapped, subcloned, and portions were 
characterized by DNA sequencing. DNA sequencing revealed that 

I. 16G encodes a ic -specific sequence as described in Example 

II. B.l. 
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c. XMR32 1.66 and IMR32 1.67 

Approximately 1 x 10 6 recombinants of IMR32 cDNA library 
#5 (Example I.B.5.) were screened with a 151 bp KpnI-SacI 
fragment of 1.16G (Example II.B.2.b.) encoding o? lc sequence (nt 
758 to 867, SEQ ID No . 3) . The hybridization was performed 
using standard hybridization conditions (Example I.e.).. The 
filters were then washed in 0.5 x SSPE at 65 °C. Of the 
positive plaques, IMR32 1.66 and IMR32 1.67 were identified. 
The hybridizing plaques were purified, restriction mapped, 
subcloned, and characterized by DNA sequencing. Two of these 
cDNA clones, IMR32 1.66 and 1.67, encode a lc subunits as 
described (Example II.B.l.). In addition, IMR32 1.66 encodes 
a partially spliced a ic transcript marked by a GT splice donor 
dinucleotide beginning at the nucleotide 3' of nt 916 (SEQ ID 
No. 3). The intron sequence within 1.66 is 101 nt long. 
IMR32 1.66 encodes the a ic initiation of translation, nt 1 to 
3 (SEQ ID No. 3) and 132 nt of 5' untranslated sequence (SEQ 
ID No. 4) precede the start codon in IMR32 1.66. 

d. IMR32 1.37 and IMR32 1.38 
Approximately 2 x 10 6 recombinants of IMR32 cDNA library 

#1 (Example I.B.I.) were screened with the CNS 1.3 0 cDNA 
fragment (Example II.B.2.a.) . The hybridization was performed 
using low stringency hybridization conditions (Example I.e.) 
and the filters were washed under low stringency (Example 
I.e.). Four positive plaques were identified, plaque 
purified, restriction mapped, subcloned, and characterized by 
DNA sequencing. Two of the clones, IMR32 1.37 and IMR32 1.38 
encode <x ic -specif ic sequences as described in Example II.B.l. 

DNA sequence comparison of IMR32 1.37 and IMR32 1.38 
revealed that the a ic transcript includes two exons that encode 
the IVS3 transmembrane domain. IMR32 1.37 has a single exon, 
nt 3 904 to 3987 (SEQ ID No . 3) and IMR32 1.3 8 appears to be 
anomalously spliced to contain both exons juxtaposed, nt 3904 
to 3987 (SEQ ID No. 3) followed by nt 1 to 84 (SEQ ID No. 6) . 
The alternative splice of the a ic transcript to contain either 
of the two exons encoding the IVS3 region was confirmed by 
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comparing the CNS 1.30 sequence to the IMR32 1.37 sequence. 
CNS 1.30 contains nt 1 to 84 (SEQ ID No. 6) preceded by the 
identical sequence contained in IMR32 1.37 for nt 2199 to 3903 
(SEQ ID No. 3). As described in Example II.B.2.a., an intron 
follows nt 1 to 84 (SEQ ID No. 6) . Two alternative exons have 
been spliced adjacent to nt 3903 (SEQ ID No. 3) represented by 
CNS 1.3 0 and IMR32 1.37. 

e . IMR32 1.86 
IMR32 cDNA library #1 (Example I.B.I.) was screened in 
duplicate using oligonucleotide probes 90-9 (nt 1462 to 1491, 
SEQ ID No. 3) and 90-12 (nt 2496 to 2520, SEQ ID No. 3) . 
These oligonucleotide probes were chosen in order to isolate 
a clone that encodes the a lc subunit between the 3 ' end of 
IMR32 1.67 (nt 1717, SEQ ID No. 3) and the 5' end of CNS 1.30 
(nt 2199, SEQ ID No. 3) . The hybridization conditions were 
standard hybridization conditions (Example I.e.) with the 
exception that the 50% deionized formamide was reduced to 20%. 
The filters were washed under low stringency (Example I.e.). 
Three positive plaques were identified one of which was IMR32 

I. 86. IMR32 1.86 was plaque purified, subcloned, and 
characterized by restriction mapping and DNA sequencing. 
IMR32 1.86 encodes <* 1C sequences as described in Example 

II. B.l. Characterization by DNA sequencing revealed that 
IMR32 1.86 contains a 73 nt deletion compared to the DNA 
encoding rabbit cardiac muscle calcium channel a x subunit 
[Mikami et al . (1989) Nature 340:230], nt 2191 to 2263. These 
missing nucleotides correspond to nt 2176-2248 of SEQ ID No. 
3. Because the 5' -end of CNS 1.30 overlaps the 3' -end of 
IMR32 1.86, some of these missing nucleotides, i.e., nt 2205- 
2248 of SEQ ID No. 3, are accounted for by CNS 1.30. The 
remaining missing nucleotides of the 73 nucleotide deletion in 
IMR32 1.86 (i.e., nt 2176-2204 SEQ ID No. 3) were determined 
by nucleic acid amplification analysis of dbcAMP- induced IMR32 
cell RNA. The 73 nt deletion is a frame-shift mutation and, 
thus, needs to be corrected. The exact human sequence through 
this region, (which has been determined by the DNA sequence of 
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CNS 1.30 and nucleic acid amplification analysis of IMR32 cell 
RNA) can be inserted into IMR32 1,86 by standard methods, 
e.g. , replacement of a restriction fragment or site-directed 
mutagenesis . 

f. IMR32 1.157 

One million recombinants of IMR32 cDNA library #4 

(Example I.B.4.) were screened with an Xhol-EcoRI fragment of 

IMR32 1.67 encoding a ic nt 50 to 774 (SEQ ID No. 3). The 

hybridization was performed using standard hybridization 

conditions (Example I.e.) . The filters were washed under high 

stringency (Example I.C.). One of the positive plaques 

identified was IMR32 1.157. This plaque was purified, the 

insert was restriction mapped and subcloned to a standard 

plasmid vector pGEM7Z (Promega, Madison, WI) . The DNA was 

characterized by sequencing. IMR32 1.157 appears to encodes 

an alternative 5' portion of the or lc sequence beginning with 

nt 1 to 89 (SEQ ID No. 5) and followed by nt 1 to 873 (SEQ ID 

No. 3). Analysis of the 1.66 and 1.157 5' sequence is 

described below (Example II. B. 3.). 

3. Characterization of the c? lc initiation of 
translation site 

Portions of the sequences of IMR32 1.157 (nt 57 to 89, 
SEQ ID No. 5; nt 1 to 67, SEQ ID No. 3), IMR32 1.66 (nt 100 to 
132, SEQ ID No. 4; nt 1 to 67, SEQ ID No . 3), were compared to 
the rabbit lung CaCB-receptor cDNA sequence, nt -33 to 67 
[Biel et al . (1990) FEBS Lett. 269:403] . The human sequences 
are possible alternative 5' ends of the a ic transcript encoding 
the region of initiation of translation. IMR32 1.66 closely 
matches the CaCB receptor cDNA sequence and diverges from the 
CaCB receptor cDNA sequence in the 5' direction beginning at 
nt 122 (SEQ ID No. 4) . The start codon identified in the CaCB 
receptor cDNA sequence is the same start codon used to 
describe the o? lc coding sequence, nt 1 to 3 (SEQ ID No. 3) . 

The sequences of a ic splice variants, designated a ic . a and 
<*ic-2 are set forth in SEQ ID NOs. 3 and 36. 
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C. Isolation of partial cDNA clones encoding the a 1B 
subunit and construction of a full-length clone 

A human basal ganglia cDNA library was screened with the 
rabbit skeletal muscle a z subunit cDNA fragments (see Example 
II. A. 2. a for description of fragments) under low stringency- 
conditions . One of the hybridizing clones was used to screen 
an IMR32 cell cDNA library to obtain additional partial a 1B 
cDNA clones, which were in turn used to further screen an 
IMR32 cell cDNA library for additional partial cDNA clones. 
One of the partial IMR32 or 1B clones was used to screen a human 
hippocampus library to obtain a partial a 1B clone encoding the 
3' end of the a 1B coding sequence. The sequence of some of the 
regions of the partial cDNA clones was compared to the 
sequence of products of nucleic acid amplification analysis of 
IMR32 cell RNA to determine the accuracy of the cDNA 
sequences . 

Nucleic acid amplification analysis analysis of IMR32 
cell RNA and genomic DNA using oligonucleotide primers 
corresponding to sequences located 5' and 3' of the STOP codon 
of the DNA encoding the <* 1B subunit revealed an alternatively 
spliced a 1B -encoding mRNA in IMR32 cells. This second mRNA 
product is the result of differential splicing of the ot lB 
subunit transcript to include another exon that is not present 
in the mRNA corresponding to the other 3' ot 1B cDNA sequence 
that was initially isolated. To distinguish these splice 
variants of the a 1B subunit, the subunit encoded by a DNA 
sequence corresponding to the form containing the additional 
exon is referred to as a 1B ^ (SEQ ID No. 7) , whereas the subunit 
encoded by a DNA sequence corresponding to the form lacking 
the additional exon is referred to as a 1B _ 2 (SEQ ID No. 8) . The 
sequence of a^.j diverges from that of a 1B _ 2 beginning at nt 
6 633 (SEQ ID No. 7) . Following the sequence of the additional 
exon in a 1B . x (nt 6633-6819; SEQ ID No. 7), the a 1B _ a and a 1B _ 2 
sequences are identical (i.e., nt 6820-7362 in SEQ ID No. 7 
and nt 6633-7175 in SEQ ID No. 8) . SEQ ID No. 7 and No. 8 set 
forth 143 nt of 5' untranslated sequence (nt 1-143) as well as 
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202 nt of 3' untranslated sequence (nt 7161-7362, SEQ ID No. 
7) of the DNA encoding a^.j and 321 nt of 3' untranslated 
sequence (nt 6855-7175, SEQ ID No. 8) of the DNA encoding a 1B _ 2 - 



Nucleic acid amplification analysis analysis of the IS6 



region of the 



transcript revealed what appear to be 



additional splice variants based on multiple fragment sizes 
seen on an ethidium bromide -stained agarose gel containing the 
products of the amplification reaction. 

A full-length a 1B . a cDNA clone designated pcDNA-a 1B .! was 
prepared in an eight-step process as follows. 

STEP 1: The Sad restriction site of pGEM3 (Promega, 
Madison, WI) was destroyed by digestion at the Sad 
site, producing blunt ends by treatment with T4 DNA 
polymerase, and religation. The new vector was 
designated pGEMASac . 

STEP 2: Fragment 1 (Hindlll/Kpnl ; nt 2337 to 4303 of SEQ ID 
No. 7) was ligated into Hlndlll/ Kpnl digested 
pGEM3 ASac to produce pal.l77HK. 

STEP 3: Fragment 1 has a 2 nucleotide deletion (nt 3852 and 
3 853 of SEQ ID No. 7) . The deletion was repaired 
by inserting an amplfied fragment (fragment 2) of 
IMR32 RNA into pofl.l77HK. Thus, fragment 2 
(NarX/Kpnl; nt 3828 to 4303 of SEQ ID No. 7) was 
inserted into Narl/Kpnl digested pal.l77HK 
replacing the Narl/Kpnl portion of fragment 1 and 
producing pal . 177HK/PCR. 

STEP 4: Fragment 3 (KpnI/Kpnl ; nt 4303 to 5663 of SEQ ID 
No. 7) was ligated into JCpnl digested pod . 177HK/PCR 
to produce palBS'K. 

STEP 5: Fragment 4 (£coRl/HindIII ; EcdRI adaptor plus nt 1 
to 2337 of SEQ ID No. 7) and fragment 5 
(Hindlll/Xhol fragment of po?lB5'K; nt 2337 to 5446 
of SEQ ID No. 7) were ligated together into 
EcoRI/XhoI digested pcDNAl (Invitrogen, San Diego, 
CA) to produce parlB5' . 
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Fragment 6 .( EcoRI / EcoRX ; EcoRI adapters on both 
ends plus nt 5749 to 7362 of SEQ ID No. 7) was 
ligated into EcoRI digested pBluescript II KS 
(Stratagene, La Jolla, CA) with the 5' end of the 
fragment proximal to the JCpnl site in the 
polylinker to produce pal. 230. 

Fragment 7 (KpnI/XhoI; nt 4303 to 544 6 of SEQ ID 
No. 7), and fragment 8 (Xhol/Cspl; nt 5446 to 6259 
of SEQ ID No. 7) were ligated into Kpnl/Cspl 
digested pal. 230 (removes nt 5749 to 6259 of SEQ ID 
No. 7 that was encoded in pal. 23 0 and maintains nt 
6259 to 7362 of SEQ ID No . 7) to produce palB3 ' . 
Fragment 9 {Sphl/Xhol; nt 4 993 to 544 6 of SEQ ID 
No. 7) and fragment 10 (Xhol/Xbal of palB3 ' ; nt 
5446 to 7319 of SEQ ID No. 7) were ligated into 
Sphl/Xbal digested palB5 ' (removes nt 4993 to 5446 
of SEQ ID No. 7 that were encoded in palB5' and 
maintains nt 1 to 4 850 of SEQ ID No. 7) to produce 
pcDNAa 1B _ 2 . 

The resulting construct, pcDNAa 1B _ a , contains, in pCDNAl, 

a full-length coding region encoding (nt 144-7362, SEQ ID 

No. 7), plus 5' untranslated sequence (nt 1-143, SEQ ID No. 7) 

and 3' untranslated sequence (nt 7161-7319, SEQ ID No. 7) 

under the transcriptional control of the CMV promoter. 

D. Isolation of DNA encoding human calcium channel a 1A 
subunits 

1. Isolation of partial clones 

DNA clones encoding portions of human calcium channel a 1A 
subunits were obtained by hybridization screening of human 
cerebellum cDNA libraries and nucleic acid amplification of 
human cerebellum RNA. Clones corresponding to the 3' end of 
the a 1A coding sequence were isolated by screening 1 x 10 6 
recombinants of a randomly primed cerebellum cDNA library 

(size-selected for inserts greater than 2.8 kb in length) 
under low stringency conditions (6X SSPE, 5X Denhart's 

solution, 0.2% SDS, 200 fig /ml sonicated herring sperm DNA, 



STEP 6: 



STEP 7: 



STEP 8: 
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42 °C) with oligonucleotide 704 containing nt 6190-6217 of the 
rat a 1A coding sequence [Starr et al . (1992) Proc. Natl. Acad. 
Sex. U.S.A. 88:5621-5625]. Washes were performed under low 
stringency conditions. Several clones that hybridized to the 
probe (clones al . 251 -al . 259 and al.244) were purified and 
characterized by restriction enzyme mapping and DNA sequence 
analysis. At least two of the clones, al.244 and al.254, 
contained a translation termination codon. Although clones 
al.244 and al.254 are different lengths, they both contain a 
sequence of nucleotides that corresponds to the extreme 3 ' end 
of the a 1A transcript , i.e. , the two clones overlap. These two 
clones are identical in the region of overlap, except, clone 
al.244 contains a sequence of 5 and a sequence of 12 
nucleotides that are not present in al.254. 

To obtain additional a 1A -encoding clones, 1 x 10 6 
recombinants of a randomly primed cerebellum cDNA library 
(size-selected for inserts ranging from 1.0 to 2.8 kb in 
length) was screened for hybridization to three 
oligonucleotides: oligonucleotide 701 (containing nucleotides 
2288-2315 of the rat a^ coding sequence) , oligonucleotide 702 
(containing nucleotides 3559-3585 of the rat a 1A coding 
sequence) and oligonucleotide 703 (containing nucleotides 
4798-4827 of the rat a 1A coding sequence) . Hybridization and 
washes were performed using the same conditions as used for 
the first screening with oligonucleotide 704, except that 
washes were conducted at 45°C. Twenty clones (clones al.26 9- 
al.288) hybridized to the probe. Several clones were plaque- 
purified and characterized by restriction enzyme mapping and 
DNA sequence analysis. One clone, al.279, contained a 
sequence of about 170 nucleotides that is not present in other 
clones corresponding to the same region of the coding 
sequence. This region may be present in other splice 
variants. None of the clones contained a translation 
intiation codon. 

To obtain clones corresponding to the 5' end of the human 
a 1A coding sequence, another cerebellum cDNA library was 
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prepared using oligonucleotide 720 (containing nucleotides 
2485-2510 of SEQ ID No. 22) to specifically prime first-strand 
cDNA synthesis. The library (8 x 10 5 recombinants) was 

screened for hybridization to three oligonucleotides: 
oligonucleotide 701, oligonucleotide 726 (containing 
nucleotides 2333-2360 of the rat a 1A coding sequence) and 
oligonucleotide 700 (containing nucleotides 767-796 of the rat 
ar 1A coding sequence) under low stringency hybridization and 
washing conditions. Approximately 50 plaques hybridized to 
the probe. Hybridizing clones al . 3 81-al . 3 90 were plaque- 
purified and characterized by restriction enzyme maping and 
DNA sequence analysis. At least one of the clones, al.3 81, 
contained a translation initiation codon. 

Alignment of the sequences of the purified clones 
revealed that the sequences overlapped to comprise the entire 
or 1A coding sequence. However, not all the overlapping 
sequences of partial clones contained convenient enzyme 
restriction sites for use in ligating partial clones to 
construct a full-length ot^ coding sequence. To obtain DNA 
fragments containing convenient restriction enzyme sites that 
could be used in constructing a full-length a 1A DNA, cDNA was 
synthesized from RNA isolated from human cerebellum tissue and 
subjected to nucleic acid amplification. The oligonucleotides 
used as primers corresponded to human a 1A coding sequence 
located 5' and 3' of selected restriction enzyme sites. Thus, 
in the first amplification reaction, oligonucleotides 753 
(containing nucleotides 2368-2391 of SEQ ID No. 22) and 728 
(containing nucleotides 3179-3202 of SEQ ID No. 22) were used 
as the primer pair. To provide a sufficient amount of the 
desired DNA fragment, the product of this amplification was 
reamplified using oligonucleotides 753 and 754 (containing 
nucleotides 3112-3135 of SEQ ID No. 22 as the primer pair. 
The resulting product was 76 8 bp in length. In the second 
amplification reaction, oligonucleotides 719 (containing 
nucleotides 4950-4975 of SEQ ID No. 22 and 752 (containing 
nucleotides 5647-5670 of SEQ ID No. 22) were used as the 
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primer pair. To provide a sufficient amount of the desired 
second DNA fragment, the product of this amplification was 
reamplif ied using oligonucleotides 756 (containing nucleotides 
5112-5135 of SEQ ID No. 22) and 752 as the primer pair. The 
resulting product was 559 bp in length. 

2. Construction of full-Length or 1A coding sequences 

Portions of clone al.38l, the 768-bp nucleic acid 
amplification product, clone al.278, the 559-bp nucleic acid 
amplification product, and clone al.244 were ligated at 
convenient restriction sites to generate a full-length a IA 
coding sequence referred to as a 1A _ 2 . 

Comparison of the results of sequence analysis of clones 
al.244 and al.254 indicated that the primary transcript of the 
a 1A subunit gene is alternatively spliced to yield at least two 
variant mRNAs encoding different forms of the a 1A subunit . One 
form, a 1A _ 1# is encoded by the sequence shown in SEQ ID No. 22. 
The sequence encoding a second form, a 1A _ 2 , differs from the a 1A _ 
i-encoding sequence at the 3' end in that it lacks a 5-nt 
sequence found in clone al.244 (nucleotides 7035-7039 of SEQ 
ID No. 22) . This deletion shifts the reading frame and 
introduces a translation termination codon resulting in an a 1A _ 2 
coding sequence that encodes a shorter a 1A subunit than that 
encoded by the a 1A „! splice variant. Consequently, a portion 
of the 3' end of the a XA . a coding sequence is actually 3' 
untranslated sequence in the a 1A _ 2 DNA. The complete sequence 
of ttiA-z/ which can be constructed by ligating portions of clone 
al.381, the 768-bp nucleic acid amplification product, clone 
al.278, the 559-bp nucleic acid amplification product and 
clone al.254, is set forth in SEQ ID No. 23. 

E. Isolation of DNA Encoding the a 1E Subunit 

DNA encoding a 1E subunits of the human calcium channel 
were isolated from human hippocampus libraries. The selected 
clones sequenced. DNA sequence analysis of DNA clones 
encoding the a 1E subunit indicated that at least two 
alternatively spliced forms of the same a 1E subunit primary 
transcript are expressed. One form has the sequence set forth 
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in SEQ ID No. 24 and was designated a 1E . a and the other was 
designated a 1E _ 3/ which has the sequence obtained by inserting 
a 57 base pair fragment between nucleotides 24 05 and 24 06 of 
SEQ ID No. 24. The resulting sequence is set forth in SEQ ID 
No. 25. 

The subunit designated a 1E ^ has a calculated molecular 
weight of 254,836 and the subunit designated a 1EO has a 
calculated molecular weight of 257,34 8. a 1E . 3 has a 19 amino 
acid insertion (encoded by SEQ ID No. 25) relative to in 
the region that appears to be the cytoplasmic loop between 
transmembrane domains IIS6 and IIIS1. 

EXAMPLE Ills ISOLATION OF cDNA CLONES ENCODING THE HUMAN 

NEURONAL CALCIUM CHANNEL £ 2 subunit 

A. Isolation of partial cDNA clones encoding the 0 
subunit and construction of a full-length clone 
encoding the /S a subunit 

A human hippocampus cDNA library was screened with the 
rabbit skeletal muscle calcium channel P 1 subunit cDNA 
fragment (nt 441 to 1379) [for isolation and sequence of the 
rabbit skeletal muscle calcium channel /3 X subunit cDNA, see 
U.S. Patent Application Serial NO. 482,384 or Ruth et al . 
(1989) Science 245:1115] using standard hybridization 
conditions (Example I.e.). A portion of one of the 
hybridizing clones was used to rescreen the hippocampus 
library to obtain additional cDNA clones. The cDNA inserts of 
hybridizing clones were characterized by restriction mapping 
and DNA sequencing and compared to the rabbit skeletal muscle 
calcium channel /3 X subunit cDNA sequence. 

Portions of the partial ^ x subunit cDNA clones were 
ligated to generate a full-length clone encoding the entire /Gj 
subunit. SEQ ID No. 9 shows the ^ subunit coding sequence 
(nt 1-1434) as well as a portion of the 3' untranslated 
sequence (nt 1435-1546) . The deduced amino acid sequence is 
also provided in SEQ ID No. 9. In order to perform expression 
experiments, full-length /$ a subunit cDNA clones were 
constructed as follows . 
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Step l: DNA fragment 1 (-800 bp of 5' untranslated 
sequence plus nt 1-277 of SEQ ID No. 9) was ligated to DNA 
fragment 2 (nt 277-1546 of SEQ ID No. 9 plus 44 8 bp of intron 
sequence) and cloned into pGEM7Z. The resulting plasmid, p£l- 
1.18, contained a full-length (3 1 subunit clone that included 
a 44 8 -bp intron. 

Step 2: To replace the 5' untranslated sequence of p£l- 
1.18 with a ribosome binding site, a double- stranded adapter 
was synthesized that contains an EcoRI site, sequence encoding 
a ribosome binding site (5' -ACCACC-3 ' ) and nt 1-25 of SEQ ID 
No. 9. The adapter was ligated to Smal -digested p/?l-1.18, and 
the products of the ligation reaction were digested with 
EcoRI . 

Step 3 : The EcdRI fragment from step 2 containing the 
.EcoRI adapter, efficient ribosome binding site and nt 1-154 6 
of SEQ ID No. 9 plus intron sequence was cloned into a plasmid 
vector and designated p/Bl-1 . 18RBS . The £?coRI fragment of p01- 
1.18RBS was subcloned into EcoRI -digested pcDNAl with the 
initiation codon proximal to CMV promoter to form 
pHBCaCH/S la RBS (A) . 

Step 4: To generate a full-length clone encoding the /S a 
subunit lacking intron sequence, DNA fragment 3 (nt 69-1146 of 
SEQ ID No. 9 plus 448 bp of intron sequence followed by nt 
1147-1546 of SEQ ID No. 9), was subjected to site-directed 
mutagenesis to delete the intron sequence, thereby yielding 
p/3l(-) . The EcdRI-Xhol fragment of p£l-1.18RBS (containing of 
the ribosome binding site and nt 1-277 of SEQ ID No. 9) was 
ligated to the XhoI-EcoRI fragment of p£l(-) (containing of nt 
277-1546 of SEQ ID No. 9) and cloned into pcDNAl with the 
initiation of translation proximal to the CMV promoter. The 
resulting expression plasmid was designated pHBCaCH£ lb RBS (A) . 

B. Splice Variant 0 1O 

DNA sequence analysis of the DNA clones encoding the (S 1 
subunit indicated that in the CNS at least two alternatively 
spliced forms of the same human (3 X subunit primary transcript 
are expressed. One form is represented by the sequence shown 
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in SEQ ID No. 9 and is referred to as P 1 , 2 . The sequences of 
fi^ and the alternative form, /S^, diverge at nt 1334 (SEQ 
ID No. 9) . The complete sequence (nt 1-1851) , including 

3' untranslated sequence (nt 1795-1851), is set forth in SEQ 
ID No. 10. 

EXAMPLE IV: ISOLATION OP cDNA CLONES ENCODING THE HUMAN 

NEURONAL CALCIUM CHANNEL a 2 - subunit 

A. Isolation of cDNA clones 

The complete human neuronal a 2 coding sequence (nt 35- 
3310) plus a portion of the 5' untranslated sequence (nt 1 to 
34) as well as a portion of the 3' untranslated sequence (nt 
3311-3600) is set forth in SEQ ID No. 11. 

To isolate DNA encoding the human neuronal a 2 subunit, 
human o? 2 genomic clones first were isolated by probing human 
genomic Southern blots using a rabbit skeletal muscle calcium 
channel ot 2 subunit cDNA fragment tnt 43 to 272, Ellis et al . 
(1988) Science 240:1661] . Human genomic DNA was digested with 
EcoRI, electrophoresed, blotted, and probed with the rabbit 
skeletal muscle probe using standard hybridization conditions 
(Example I.C.) and low stringency washing conditions (Example 
I.e.). Two restriction fragments were identified, 3.5 kb and 
3.0 kb. These EcoRI restriction fragments were cloned by 
preparing a Xgtll library containing human genomic EcoRI 
fragments ranging from 2.2 kb to 4.3 kb. The library was 
screened as described above using the rabbit ot 2 probe, 
hybridizing clones were isolated and characterized by DNA 
sequencing. HGCaCHor2.20 contained the 3.5 kb fragment and 
HGCaCHa2 . 9 contained the 3 . 0 kb fragment . 

Restriction mapping and DNA sequencing revealed that 
HGCaCHa2.20 contains an 82 bp exon (nt 130 to 211 of the human 
a 2 coding sequence, SEQ ID No. 11) on a 650 bp Pstl-Xbal 
restriction fragment and that HGCaCHar2 . 9 contains 105 bp of an 
exon (nt 212 to 316 of the coding sequence, SEQ ID No . 11) on 
a 750 bp Xbal-Bgrlli restriction fragment. These restriction 
fragments were used to screen the human basal ganglia cDNA 
library (Example II.C.2.a.). HBCaCHa2 . 1 was isolated (nt 29 
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to 1163, SEQ ID No. 11) and used to screen a human brain stem 
cDNA library (ATCC Accession No. 37432) obtained from the 
American Type Culture Collection, 123 01 Parklawn Drive, 
Rockville, MD. 20852. Two clones were isolated, HBCaCHa2 . 5 
(nt 1 to 1162, SEQ ID No. 11) and HBCaCHor2 . 8 (nt 714 to 1562, 
SEQ ID No. 11, followed by 1600 nt of intervening sequence) . 
A 2400 bp fragment of HBCaCH<*2 . 8 (beginning at nt 759 of SEQ 
ID No. 11 and ending at a Smal site in the intron) was used to 
rescreen the brain stem library and to isolate HBCaCHor2.11 (nt 
879 to 3600, SEQ ID No. 11). Clones HBCaCHa2 . 5 and 
HBCaCHa2.ll overlap to encode an entire human brain ct 2 
protein. 

B. Construction of pHBCaCHor 2 A 

To construct pHBCaCHa-A containing DNA encoding a full- 
length human calcium channel ql 2 subunit, an (EcoRI) -PvuII 
fragment of HBCaCHa2.5 (nt 1 to 1061, SEQ ID No. 11, £coRI 
adapter, PvuII partial digest) and a PvuII-PstI fragment of 
HBCaCHQf2.11 (nt 1061 to 2424 SEQ ID No. 11; PvuII partial 
digest) were ligated into JScoRI-PstI- digested pIBI24 
(Stratagene, La Jolla, CA) . Subsequently, an (EcoRI) -PstI 
fragment (nt 1 to 2424 SEQ ID No. 11) was isolated and ligated 
to a PstI- (EcoRI) fragment (nt 2424 to 3600 SEQ ID No. 11) of 
HBCaCHor2.il in EcoRI -digested pIBI24 to produce DNA, HBCaCHc*2 , 
encoding a full-length human brain ct 2 subunit . The 3600 bp 
.EcoRI insert of HBCaCHo?2 (nt 1 to 3600, SEQ ID No. 11) was 
subcloned into pcDNAl ( pHB Ca CHa 2 A ) with the methionine 
initiating codon proximal to the CMV promoter. The 3600 bp 
EcoRI insert of HBCaCHor2 was also subcloned into pSV2dHFR 
[Subramani et al . (1981). Mol. Cell. Biol. 1:854-864] which 
contains the SV4 0 early promoter, mouse dihydrof olate 
reductase (dhfr) gene, SV4 0 polyadenylation and splice sites 
and sequences required for maintenance of the vector in 
bacteria. 



WO 95/04822 



PCT/US94/09230 



-81- 

EXAMPLE V. DIFFERENTIAL PROCESSING OF THE HUMAN B 

TRANSCRIPT AND THE HUMAN a 2 TRANSCRIPT 1 
A. Differential processing of the 0 2 transcript 

Nucleic acid amplification analysis of the human p x 
transcript present in skeletal muscle, aorta, hippocampus and 
basal ganglia, and HEK 293 cells revealed differential 
processing of the region corresponding to nt 615-781 of SEQ ID 
No. 9 in each of the tissues. Four different sequences that 
result in five different processed f3 x transcripts through this 
region were identified. The (J x transcripts from the different 
tissues contained different combinations of the four 
sequences, except for one of the fi l transcripts expressed in 
HEK 2 93 cells (£ x _ 5 ) which lacked all four sequences. 

None of the (3 1 transcripts contained each of the four 
sequences; however, for ease of reference, all four sequences 
are set forth end-to-end as a single long sequence in SEQ ID 
No. 12. The four sequences that are differentially processed 
are sequence 1 (nt 14-34 in SEQ ID No. 12) , sequence 2 (nt 35- 
55 in SEQ ID No. 12), sequence 3 (nt 56-190 in SEQ ID No. 12) 
and sequence 4 (nt 191-271 in SEQ ID No. 12) . The forms of 
the /?! transcript that have been identified include: (1) a 
form that lacks sequence 1 called (3^ (expressed in skeletal 
muscle) , (2) a form that lacks sequences 2 and 3 called P x . 2 
(expressed in CNS) , (3) a form that lacks sequences l, 2 and 3 
called fi lm4 (expressed in aorta and HEK cells) and (4) a form 
that lacks sequences 1-4 called 0 lmS (expressed in HEK cells) . 
Additionally, the (3^ and 0 lmS contain a guanine nucleotide (nt 
13 in SEQ ID No. 12) that is absent in the fi lml and (3 X „ 2 forms. 
The sequences of fi x splice variants are set forth in SEQ ID 
Nos. 9, 10 and 33-35. 

B. Differential processing of transcripts encoding the 
a 2 subunit. 

The complete human neuronal a 2 coding sequence (nt 3 5- 
330.7) plus a portion of the 5' untranslated sequence (nt l to 
34) as well as a portion of the 3' untranslated sequence (nt 
3308-3600) is set forth as SEQ ID No. 11. 
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Nucleic acid amplification analysis of the human a 2 
transcript present in skeletal muscle, aorta, and CNS revealed 
differential processing of the region corresponding to nt 
1595-1942 of SEQ ID No. 11 in each of the tissues. 

The analysis indicated that the primary transcript 
of the genomic DNA that includes the nucleotides corresponding 
to nt 1595-1942 also includes an additional sequence (SEQ ID 
No. 13 : 5 ' CCTATTGGTGTAGGTATACCAACAATTAATTT 
AAGAAAAAGGAGACCCAATATCCAG 3') inserted between nt 1624 and 
1625 of SEQ ID No. 11. Five alternatively spliced variant 

transcripts that differ in the presence or absence of one to 
three different portions of the region of the primary 
transcript that includes the region of nt 1595-1942 of SEQ ID 
No. 11 plus SEQ ID No. 13 inserted between nt 1624 and 1625 
have been identified. The five a 2 -encoding transcripts from 
the different tissues include different combinations of the 
three sequences, except for one of the ct 2 transcripts 
expressed in aorta which lacks all three sequences. None of 
the at 2 transcripts contained each of the three sequences . The 
sequences of the three regions that are differentially 
processed are sequence 1 (SEQ ID No. 13) , sequence 2 ( 5' 
AACCCCAAATCTCAG 3', which is nt 1625-1639 of SEQ ID No. 11), 
and sequence 3 (5' CAAAAAAGGGCAAAATGAAGG 3', which is nt 
1908-1928 of SEQ ID No. 11) . The five a 2 forms identified are 
(1) a form that lacks sequence 3 called ar 2a (expressed in 
skeletal muscle) , (2) a form that lacks sequence 1 called a 2b 
(expressed in CNS), (3) a form that lacks sequences 1 and 2 
called a 2c expressed in aorta), (4) a form that lacks 
sequences 1, 2 and 3 called a 2d (expressed in aorta) and (5) 
a form that lacks sequences 1 and 3 called a 2e (expressed in 
aorta) . 

The sequences of or 2a - a 2e are set forth in SEQ. ID Nos. 
29-32, respectively. 
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EXAMPLE VI: ISOLATION OF DNA ENCODING A CALCIUM CHANNEL 7 

SUBDNIT FROM A HUMAN BRAIN cDNA LIBRARY 

A. Isolation of DNA encoding the 7 subunit 

Approximately 1 x 10 6 recombinants from a Xgtll -based 
human hippocampus cDNA library (Clontech catalog #HL1088b, 
Palo Alto, CA) were screened by hybridization to a 4 84 bp 
sequence of the rabbit skeletal muscle calcium channel 7 
subunit cDNA (nucleotides 621-626 of the coding sequence plus 
438 nucleotides of 3 ' -untranslated sequence) contained in 
vector 7J10 [Jay, S. et al . (1990). Science 248:490-492]. 
Hybridization was performed using moderate stringency 
conditions (20% deionized formamide, 5x Denhardt's, 6 x SSPE, 
0.2% SDS, 20 tig/ml herring sperm DNA, 42 °C) and the filters 
were washed under low stringency (see Example I.e.) . A plaque 
that hybridized to this probe was purified and insert DNA was 
subcloned into pGEM7Z. This cDNA insert was designated 71.4. 

B . Characterization of 7I.4 

71. 4 was confirmed by DNA hybridization and characterized 
by DNA sequencing. The 1500 bp SstI fragment of 71. 4 
hybridized to the rabbit skeletal muscle calcium channel 7 
subunit cDNA 7JIO on a Southern blot. SEQ analysis of this 
fragment revealed that it contains of approximately 500 nt of 
human DNA sequence and -1000 nt of Xgtll sequence (included 
due to apparent destruction of one of the EcoRI cloning sites 
in Xgtll) . The human DNA sequence contains of 129 nt of 
coding sequence followed immediately by a translational STOP 
codon and 3' untranslated sequence (SEQ ID No. 14) . 

To isolate the remaining 5' sequence of the human 7 
subunit cDNA, human CNS cDNA libraries and/or preparations of 
mRNA from human CNS tissues can first be assayed by nucleic 
acid amplification analysis methods using oligonucleotide 
primers based on the 7 cDNA-specif ic sequence of 7I.4. 
Additional human neuronal 7 subunit -encoding DNA can be 
isolated from cDNA libraries that, based on the results of the 
nucleic acid amplification analysis assay, contain 7-specific 



WO 95/04822 



PCT/US94/09230 



-84- 

amplifiable cDNA. Alternatively, cDNA libraries can be 
constructed from mRNA preparations that, based on the results 
of the nucleic acid amplification analysis assays, contain y- 
specific amplifiable transcripts. Such libraries are 

constructed by standard methods using oligo dT to prime first - 
strand cDNA synthesis from poly A* RNA (see Example I.B.). 
Alternatively, first-strand cDNA can be specified by priming 
first-strand cDNA synthesis with a y cDNA-specif ic 
oligonucleotide based on the human DNA sequence in yl.4. A 
cDNA library can then be constructed based on this first - 
strand synthesis and screened with the y-specific portion of 
71 .4 . 

EXAMPLE VII: ISOULTION OF cDNA CLONES ENCODING THE HUMAN 

NEURONAL Ca CHANNEL 0 2 SUBUNIT 

Isolation of DNA Encoding human calcium channel 0 2 
subunits 

Sequencing of clones isolated as described in Example III 
revealed a clone encoding a human neuronal calcium channel 0 2 
subunit (designated 0 2D see, SEQ ID No. 26) . An 
oligonucleotide based on the 5' end of this clone was used to 
prime a human hippocampus cDNA library. The library was 
screened with this 0 2 clone under conditions of low to medium 
stringency (final wash 0.5 X SSPE, 50° C) . Several 
hybridizing clones were isolated and sequenced. Among these 
clones were those that encode 0 2C , (3 2D and 0 2E . For example, 
the sequence of 0 2C is set forth in SEQ ID NO. 37, and the 
sequeence of 0 2E is set forth in SEQ ID No. 38. 

A randomly primed hippocampus library was then screened 
using a combination of the clone encoding 0 2D and a portion of 
the /3 3 clone deposited under ATCC Accession No. 69048. 
Multiple hybridizing clones were isolated. Among these were 
clones designated $101, (3102 and 0104 . 0101 appears to 
encodes the 5' end of a splice variant of 0 2 , designated 0 2E . 
0102 and 0104 encode portions of the 3' end of /8 2 . 

It appears that the 0 2 splice variants include 
nucleotides 182-2294 of SEQ ID No. 26 and differ only between 
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the start codon and nucleotides that correspond to 212 of SEQ. 
ID No. 26. 

EXAMPLE VIII: ISOLATION OF cDNA CLONES ENCODING HUMAN 

CALCIUM CHANNEL 0 4 and /S 3 SUBUNITS 

A. Isolation of cDNA Clones Encoding a Human f3 4 Subunit 

A clone containing a translation initiation codon and 
approximately 60% of the 0 4 coding sequence was obtained from 
a human cerebellum cDNA library (see nucleotides 1-894 of 
Sequence ID No. 27) . To obtain DNA encoding the remaining 3' 
portion of the coding sequence, a human cerebellum cDNA 
library was screened for hybridization a nucleic acid 
amplification product under high stringency hybridization and 
wash conditions. Hybridizing clones are purified and 
characterized by restriction enzyme mapping and DNA sequence 
analysis to identify those that contain sequence corresponding 
to the 3 ' end of the 0 4 subunit coding sequence and a 
termination codon. Selected clones are ligated to the clone 
containing the 5' half of the /3 4 coding sequence at convenient 
restriction sites to generate a full-length cDNA encoding a /8 4 
subunit. The sequence of a full-length 0 4 clone is set forth 
in SEQ ID No. 27; the amino acid sequence is set forth in SEQ 
ID No. 28. 

B. Isolation of cDNA Clones Encoding a Human 03 
Subunit 

Sequencing of clones isolated as described in Example III 
also revealed a clone encoding a human neuronal calcium 
channel 0 3 subunit. This clone has been deposited as plasmid 
01.42 (ATCC Accession No. 69048) . 

To isolate a full-length cDNA clone encoding a complete 
/ff 3 subunit, a human hippocampus cDNA library (Stratagene, La 
Jolla, CA) was screened for hybridization to a 5' EcoRI-PstI 
fragment of the cDNA encoding /3^ 2 using lower stringency 
hybridization conditions (20% deionized formamide, 200 iig/ml 
sonicated herring sperm DNA, 5X SSPE, 5X Denhardt's solution, 
42° C) and wash conditions. One of the hybridizing clones 
contained both translation initiation and termination codons 
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and encodes a complete /3 3 subunit designated (Sequence ID 

No. 19) . In vitro transcripts of the cDNA were prepared and 
injected into Xenqpus oocytes along with transcripts of the 
a 1B . x and cr 2b cDNAs using methods similiar to those described in 
Example IX. D. Two-electrode voltage clamp recordings of the 
oocytes revealed significant voltage -dependent inward Ba 2+ 
currents . 

An additional 0 3 subunit -encoding clone, designated /S 3 . 2 / 
was obtained by screening a human cerebellum cDNA library for 
hybridization to the nucleic acid amplification product 
referred to in Example VIII. A. under lower stringency (20% 
deionized formamide, 200 fig/ml sonicated herring sperm DNA, 5X 
SSPE, 5X Denhardt's solution, 42° C) hybridization and wash 
conditions. The 5' ends of this clone (Sequence ID No. 20, 0 3 _ 
2 ) and the first £ 3 subunit, designated /3 3 _ x , (Sequence ID No. 
19) differ at their 5' ends and are splice variants of the 0 3 
gene . 

EXAMPLE IX: RECOMBINANT EXPRESSION OF HUMAN NEURONAL 

CALCIUM CHANNEL SUBUNIT -ENCODING cDNA AND RNA 
TRANSCRIPTS IN MAMMALIAN CELLS 

A. Recombinant Expression of the Human 

Neuronal Calcium Channel a 2 subunit cDNA 
in D644 Cells 

1. Stable transfection of D644 cells 

DG44 cells [dhfr" Chinese hamster ovary cells; see, e.g., 
Urlaub, G. et al . (1986) Som. Cell Molec. Genet. 12:555-566] 
obtained from Lawrence Chasin at Columbia University were 
stably transfected by CaP0 4 precipitation methods [Wigler et 
al. (1979) Proc. Natl. Acad. Sci . USA 75:1373-1376] with 
pSV2dhfr vector containing the human neuronal calcium channel 
a 2 -subunit cDNA (see Example IV) for polycistronic 
expression/selection in transfected cells. Transf ectants were 
grown on 10% DMEM medium without hypoxanthine or thymidine in 
order to select cells that had incorporated the expression 
vector. Twelve transf ectant cell lines were established as 
indicated by their ability to survive on this medium. 
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2. Analysis of a 2 subunit cDNA expression in 
transfected D644 cells 

Total RNA was extracted according to the method of 

Birnboim [(1988) Nuc. Acids Res. IS: 1487-1497] from four of 

the DG44 cell lines that had been stably transfected with 

pSV2dhfr containing the human neuronal calcium channel a 2 

subunit cDNA. RNA (-15 fig per lane) was separated on a 1% 

agarose formaldehyde gel, transferred to nitrocellulose and 

hybridized to the random-primed human neuronal calcium channel 

ct 2 cDNA (hybridization : 50% f ormamide , 5 x SSPE , 5 x 

Denhardt's, 42° C; wash :0.2 x SSPE, 0.1% SDS, 65° C). 

Northern blot analysis of total RNA from four of the DG44 cell 

lines that had been stably transfected with pSV2dhfr 

containing the human neuronal calcium channel a 2 subunit cDNA 

revealed that one of the four cell lines contained hybridizing 

mRNA the size expected for the transcript of the a 2 subunit 

cDNA (5000 nt based on the size of the cDNA) when grown in the 

presence of 10 mM sodium butyrate for two days. Butyrate 

nonspecif ically induces transcription and is often used for 

inducing the SV40 early promoter [Gorman, C. and Howard, B. 

(1983) Nucleic Acids Res. 11:1631]. This cell line, 44a 2 -9, 

also produced mRNA species smaller (several species) and 

larger (6800 nt) than the size expected for the transcript of 

the or 2 cDNA (5000 nt) that hybridized to the a 2 - cDNA-based 

probe. The 5000- and 6800-nt transcripts produced by this 

transfectant should contain the entire a 2 subunit coding 

sequence and therefore should yield a full-length a 2 subunit 

protein. A weakly hybridizing 8000-nucleotide transcript was 

present in untransf ected and transfected DG44 cells. 

Apparently, DG44 cells transcribe a calcium channel a 2 subunit 

or similar gene at low levels. The level of expression of 

this endogenous ot 2 subunit transcript did not appear to be 

affected by exposing the cells to butyrate before isolation of 

RNA for northern analysis. 

Total protein was extracted from three of the DG44 cell 

lines that had been stably transfected with pSV2dhfr 
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containing the human neuronal calcium channel ot 2 subunit cDNA. 
Approximately 10 7 cells were sonicated in 300 fil of a solution 
containing 50 mM HEPES, 1 mM EDTA, 1 mM PMSF. An equal volume 
of 2x loading dye [Laemmli, U.K. (1970) . Nature 227:680] was 
added to the samples and the protein was subjected to 
electrophoresis on an 8% polyacrylamide gel and then 
electrotransf erred to nitrocellulose. The nitrocellulose was 
incubated with polyclonal guinea pig antisera (1:200 dilution) 
directed against the rabbit skeletal muscle calcium channel ot 2 
subunit (obtained from K. Campbell, University of Iowa) 
followed by incubation with [ 125 I] -protein A. The blot was 
exposed to X-ray film at -70° C. Reduced samples of protein 
from the transfected cells as well as from untransf ected DG44 
cells contained immunoreactive protein of the size expected 
for the a 2 subunit of the human neuronal calcium channel (130- 
150 kDa) . The level of this immunoreactive protein was higher 
in 44a 2 -9 cells that had been grown in the presence of 10 mM 
sodium butyrate than in 44a 2 -9 cells that were grown in the 
absence of sodium butyrate. These data correlate well with 
those obtained in northern analyses of total RNA from 44or 2 -9 
and untransf ected DG44 cells. Cell line 44a 2 -9 also produced 
a 110 kD immunoreactive protein that may be either a product 
of proteolytic degradation of the full-length a 2 subunit or a 
product of translation of one of the shorter (<5000 nt) mRNAs 
produced in this cell line that hybridized to the a 2 subunit 
cDNA probe. 

B. Expression of DNA encoding human 
neuronal calcium channel ct x , a 2 and fi 1 
subunits in HEK cells 

Human embryonic kidney cells (HEK 293 cells) were 

transiently and stably transfected with human neuronal DNA 

encoding calcium channel subunits. Individual transf ectants 

were analyzed electrophysiological^ for the presence of 

voltage-activated barium currents and functional recombinant 

voltage -dependent calcium channels were. 
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1. Transfection of HEK 293 cells 

Separate expression vectors containing DNA encoding human 
neuronal calcium channel a lD , a 3 and fi x subunits, plasmids 
pVDCClll(A), pHBCaCHofjA, and pHBCaCH/3 la RBS (A) , respectively, 
were constructed as described in Examples II. A. 3, iv.B. and 
IXI.B.3., respectively. These three vectors were used to 
transiently co-transfect HEK 293 cells. For stable 

transfection of HEK 293 cells, vector pHBCaCH/3 lb RBS (A) (Example 
III.B.3.) was used in place of P HBCaCH/J la RBS (A) to introduce 
the DNA encoding the ft subunit into the cells along with 
pVDCCIII (A) and pHBCaCHa 2 A . 

a. Transient transfection 
Expression vectors pVDCCIII (A) , pHBCaCHajA and 
pHBCaCH0 la RBS (A) were used in two sets of transient 
transfections of HEK 293 cells (ATCC Accession No. CRL1573) . 
In one transfection procedure, HEK 293 cells were transiently 
cotransfected with the a, subunit cDNA expression plasmid, the 
a 2 subunit cDNA expression plasmid, the ft subunit cDNA 
expression plasmid and plasmid pCMV/Sgal (Clontech 
Laboratories, Palo Alto, CA) . Plasmid pCMV/Jgal contains the 
lacZ gene (encoding E. coli /3-galactosidase) fused to the 
cytomegalovirus (CMV) promoter and was included in this 
transfection as a marker gene for monitoring the efficiency of 
transfection. In the other transfection procedure, HEK 293 
cells were transiently co-transf ected with the a x subunit cDNA 
expression plasmid pVDCCIII (A) and pCMV/3gal . in both 
transfections, 2-4 x 10 6 HEK 293 cells in a 10-cm tissue 
culture plate were transiently co-transf ected with 5 fig of 
each of the plasmids included in the experiment according to 
standard CaP0 4 precipitation transfection procedures (Wigler 
et al. (1979) Proc. Natl. Acad. Sci. USA 76:1373-1376). The 
transfectants were analyzed for jS-galactosidase expression by 
direct staining of the product of a reaction involving (3- 
galactosidase and the X-gal substrate [Jones, J.R. (1986) EMBO 
5:3133-3142] and by measurement of /8-galactosidase activity 
[Miller, J.h. (1972) Experiments in Molecular Genetics, pp. 
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352-355, Cold Spring Harbor Press] . To evaluate subunit cDNA 
expression in these transf ectants , the cells were analyzed for 
subunit transcript production (northern analysis) , subunit 
protein production (immunoblot analysis of cell lysates) and 
functional calcium channel expression (electrophysiological 
analysis) . 

b. Stable transf ection 

HEK 293 cells were transfected using the calcium 
phosphate transf ection procedure [Current Protocols in 
Molecular Biology, Vol. 1, Wiley Inter- Science, Supplement 14, 
Unit 9.1.1-9.1.9 (1990)]. Ten-cm plates , each containing one- 
to-two million HEK 293 cells, were transfected with 1 ml of 
DNA/calcium phosphate precipitate containing 5 /xg pVDCCIII (A) , 
5 iiq pHBCaCHor 2 A , 5/xg pHBCaCH/? lb RBS (A) , 5 /xg pCMVBgal and 1 fig 
pSV2neo (as a selectable marker) . After 10-20 days of growth 
in media containing 500 ixg G418, colonies had formed and were 
isolated using cloning cylinders. 

2 • Analysis of HEK 293 cells transiently 
transfected with DNA encoding human neuronal 
calcium channel subunit s 

a. Analysis of /S-galactoeidase expression 

Transient transf ectants were assayed for /S-galactosidase 
expression by /3-galactosidase activity assays (Miller, J.H., 

(1972) Experiments in Molecular Genetics, pp. 352-355, Cold 
Spring Harbor Press) of cell lysates (prepared as described in 
Example VII. A. 2) and staining of fixed cells (Jones, J.R. 

(1986) EMBO 5:3133-3142). The results of these assays 
indicated that approximately 30% of the HEK 293 cells had been 
transfected. 

b. Northern analysis 

PolyA+ RNA was isolated using the Invitrogen Fast Trak 
Kit (InVitrogen, San Diego, CA) from HEK 293 cells transiently 
transfected with DNA encoding each of the o lf a 2 and p x 
subunits and the lacZ gene or the a 1 subunit and the lacZ 
gene. The RNA was subjected to electrophoresis on an agarose 
gel and transferred to nitrocellulose. The nitrocellulose was 
then hybridized with one or more of the following radiolabeled 
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probes: the lacZ gene, human neuronal calcium channel a A 
subunit -encoding cDNA, human neuronal calcium channel 
subunit- encoding cDNA or human neuronal calcium channel p\ 
subunit -encoding cDNA. Two transcripts that hybridized with 
the subunit -encoding cDNA were detected in HEK 293 cells 
transfected with the DNA encoding the a lt a 2 , and fi t subunits 
and the lacZ gene as well as in HEK 293 cells transfected with 
the a, subunit cDNA and the lacZ gene. One mRNA species was 
the size expected for the transcript of the a, subunit cDNA 
(8000 nucleotides) . The second RNA species was smaller (4000 
nucleotides) than the size expected for this transcript. RNA 
of the size expected for the transcript of the lacZ gene was 
detected in cells transfected with the a t , ct 3 and fi x subunit- 
encoding cDNA and the lacZ gene and in cells transfected with 
the a, subunit cDNA and the lacZ gene by hybridization to the 
lacZ gene sequence. 

RNA from cells transfected with the ot lt ot 2 and 0 1 subunit- 
encoding cDNA and the lacZ gene was also hybridized with the 
QT 2 and 0, subunit cDNA probes. Two mRNA species hybridized to 
the a 2 subunit cDNA probe. One species was the size expected 
for the transcript of the ot 2 subunit cDNA (4000 nucleotides) . 
The other species was larger (6000 nucleotides) than the 
expected size of this transcript. Multiple RNA species in the 
cells co-transfected with a,, a 2 and 0 X subunit -encoding cDNA 
and the lacZ gene hybridized to the 0, subunit cDNA probe. 
Multiple 0 subunit transcripts of varying sizes were produced 
since the p subunit cDNA expression vector contains two 
potential polyA* addition sites. 

c. Electrophysiological analysis 
Individual transiently transfected HEK 293 cells were 
assayed for the presence of voltage-dependent barium currents 
using the whole -cell variant of the patch clamp technique 
[Hamill et al . (1981). Pflugers Arch. 391:85-100]. HEK 293 
cells transiently transfected with pCMV/Sgal only were assayed 
for barium currents as a negative control in these 
experiments. The cells were placed in a bathing solution that 
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contained barium ions to serve as the current carrier. 
Choline chloride, instead of NaCl or KCl, was used as the 
major salt component of the bath solution to eliminate 
currents through sodium and potassium channels. The bathing 
solution contained 1 mM MgCl 2 and was buffered at pH 7.3 with 
10 mM HEPES (pH adjusted with sodium or tetraethyl ammonium 
hydroxide) . Patch pipettes were filled with a solution 
containing 135 mM CsCl, 1 mM MgCl 2 , 10 mM glucose, 10 mM EGTA, 
4 mM ATP and 10 mM HEPES (pH adjusted to 7.3 with 
tetraethylammonium hydroxide) . Cesium and tetraethylammonium 
ions block most types of potassium channels. Pipettes were 
coated with Sylgard (Dow- Corning, Midland, MI) and had 
resistances of 1-4 megohm. Currents were measured through a 
500 megohm headstage resistor with the Axopatch IC (Axon 
instruments, Foster City, CA) amplifier, interfaced with a 
Labmaster (Scientific Solutions, Solon, OH) data acquisition 
board in an IBM- compatible PC. PClamp (Axon Instruments) was 
used to generate voltage commands and acquire data. Data were 
analyzed with pClamp or Quattro Professional (Borland 
International, Scotts Valleyr^r^r-ograms . 

To apply drugs, "puffer" ~pipetteXPOsitioned within 
several micrometers of the cell under study Were used to apply 
solutions by pressure application. The drugs used for 
pharmacological characterization were dissolved in a solution 
identical to the bathing solution. Samples of a 10 mM stock 
solution of Bay K 8644 (RBI, Natick, MA), which was prepared 
in DMSO, were diluted to a final concentration of 1 mM in 15 
mM Ba 2t - containing bath solution before they were applied. 

Twenty-one negative control HEK 293 cells (transiently 
transfected with the lacZ gene expression vector pCMV/Jgal 
only) were analyzed by the whole -cell variant of the patch 
clamp method for recording currents. Only one cell displayed 
a discernable inward barium current; this current was not 
affected by the presence of 1 fM Bay K 8644. In addition, 
application of Bay K 8644 to four cells that did not display 
Ba 2 * currents did not result in the appearance of any currents . 
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Two days after transient transfection of HEK 293 cells 
with a x , a 2 and 0 X subunit- encoding cDNA and the la.cZ gene, 
individual transf ectants were assayed for volt age -dependent 
barium currents. The currents in nine transf ectants were 
recorded. Because the efficiency of transfection of one cell 
can vary from the efficiency of transfection of another cell, 
the degree of expression of heterologous proteins in 
individual transf ectants varies and some cells do not 
incorporate or express the foreign DNA. Inward barium 
currents were detected in two of these nine transf ectants . In 
these assays, the holding potential of the membrane was -90 
mV. The membrane was depolarized in a series of voltage steps 
to different test potentials and the current in the presence 
and absence of 1 pM Bay K 8644 was recorded. The inward 
barium current was significantly enhanced in magnitude by the 
addition of Bay K 8644. The largest inward barium current 
(-160 pA) was recorded when the membrane was depolarized to 0 
mV in the presence of l fM Bay K 8644. A comparison of the 
I-V curves, generated by plotting the largest current recorded 
after each depolarization versus the depolarization voltage, 
corresponding to recordings conducted in the absence and 
presence of Bay K 8644 illustrated the enhancement of the 
voltage-activated current in the presence of Bay K 8644. 

Pronounced tail currents were detected in the tracings 
of currents generated in the presence of Bay K 8644 in HEK 293 
cells transfected with a lf a 2 and fi i subunit -encoding cDNA and 
the lacz gene, indicating that the recombinant calcium 
channels responsible for the voltage- activated barium currents 
recorded in this transfected appear to be DHP- sensitive . 

The second of the two transfected cells that displayed 
inward barium currents expressed a -50 pA current when the 
membrane was depolarized from -90 mV. This current was nearly 
completely blocked by 200 M M cadmium, an established calcium 

channel blocker. 

Ten cells that were transiently transfected with the DNA 
encoding the « t subunit and the lacZ gene were analyzed by 
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whole-cell patch clamp methods two days after transaction 
One of these cells displayed a 30 P A inward barium current 
This current amplified 2- fold in the presence of i m Bay K 
8644. Furthermore, small tail currents were detected in the 
presence of Bay K 8644. These data indicate that expression 
of the human neuronal calcium channel a lD subunit -encoding cDNA 
in HEK 293 yields a functional DHP-sensitive calcium channel. 

3 * St?*™* 0 * ^ 293 cells stably transfected 
with DNA encoding human neuronal calcium 
channel subunits 

individual stably transfected HEK 293 cells were assayed 

electrophysiologically for the presence of voltage -dependent 

barium currents as described for electrophysiological analysis 

of transiently transfected HEK 293 cells (see Example 

VII. B. 2. c) . m an effort to maximize calcium channel activity 

via cyclic-AMP-dependent kinase-mediated phosphorylation 

[Pelzer, et al . (1990) Rev. Physiol. Biochem. Pharmacol. 

114:107-207], cAMP < Na salt. 250 „M) was added to the pipet 

solution and forskolin (10 M M) was added to the bath solution 

m some of the recordings. Qualitatively similar results were 

obtained whether these compounds were present or not. 

Barium currents were recorded from stably transfected 

cells in the absence and presence of Bay k 8644 (1 . when 

the cell was depolarized to -10 mV from a holding potential of 

-90 rnv in the absence of Bay K 8644, a current of 

approximately 35pA with a rapidly deactivating tail current 

was recorded. During application of Bay K 8644, an identical 

depolarizing protocol elicited a current of approximately 75 

PA, accompanied by an augmented and prolonged tail current. 

The peak magnitude of currents recorded from this same cell as 

a function of a series of depolarizing voltages were assessed. 

The responses in the presence of Bay K 8644 not only 

increased, but the entire current -voltage relation shifted 

about -10 mV. Thus, three typical hallmarks of Bay K 8644 

action, namely increased current magnitude, prolonged tail 

currents, and negatively shifted activation voltage, were 
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observed, clearly indicating the expression of a DHP-sensitive 
calcium channel in these stably transfected cells. No such 
effects of Bay K 8644 were observed in untransf ected HEK 293 
cells, either with or without cAMP or forskolin. 

C. Use of pCMV-based vectors and pcDNAl-based vectors 

for expression of DNA encoding human neuronal 

calcium channel subunits 

1* Preparation of constructs 

Additional expression vectors were constructed using 
pCMV. The full-length 0f 1D cDNA from pVDCCIII(A) (see Example 

II. A.3.d), the full-length a 2 cDNA, contained on a 3600 bp 
EcoRI fragment from HBCaCHa 2 (see Example IV. B) and a full- 
length p 1 subunit cDNA from pHBCaCH/3 lb RBS (A) (see Example 

III. B.3) were separately subcloned into plasmid pCMV/3gal . 
Plasmid pCMV/Sgal was digested with NotI to remove the lacZ 
gene. The remaining vector portion of the plasmid, referred 
to as pCMV, was blunt-ended at the NotI sites. The full- 
length a 2 -encoding DNA and /^-encoding DNA, contained on 
separate EcoRI fragments, were isolated, blunt -ended and 
separately ligated to the blunt -ended vector fragment of pCMV 
locating the cDNAs between the CMV promoter and SV4 0 
polyadenylation sites in pCMV. To ligate the a 1D -encoding cDNA 
with pCMV, the restriction sites in the polylinkers 
immediately 5' of the CMV promoter and immediately 3' of the 
SV40 polyadenylation site were removed from pCMV. A 
polylinker was added at the NotI site. The polylinker had the 
following sequence of restriction enzyme recognition sites: 
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The a 1D -encoding DNA, isolated as a BairiRT/Xhol fragment from 
pVDCCIII(A), was then ligated to Xball/Sall-digested pCMV to 
place it between the CMV promoter and SV40 polyadenylation 
site. 

Plasmid pCMV contains the CMV promoter as does pcDNAl, 
but differs from pcDNAl in the location of splice donor/splice 
acceptor sites relative to the inserted subunit- encoding DNA. 
After inserting the subunit -encoding DNA into pCMV, the splice 
donor/splice acceptor sites are located 3' of the CMV promoter 
and 5' of the subunit -encoding DNA start codon. After 
inserting the subunit -encoding DNA into pcDNAl, the splice 
donor/splice acceptor sites are located 3' of the subunit cDNA 
stop codon. 

2. Transfection of HER 293 cells 

HEK 293 cells were transiently co-transf ected with the 

Qfio, a 3 and /3, subunit -encoding DNA in pCMV or with the ot 1D , ot 2 

and 0 subunit -encoding DNA in pcDNAl (vectors pVDCCIII (A) , 

pHBCaCHor 2 A and P HBCaCH/? lb RBS (A) , respectively) , as described in 

Example VIl.B.l.a. Plasmid pCMVjSgal was included in each 

transfection as a measure of transfection efficiency. The 

results of /3-galactosidase assays of the transf ectants (see 

Example VII. B. 2.), indicated that HEK 293 cells were 

transfected equally efficiently with pCMV- and pcDNAl-based 

plasmids. The pcDNAl-based plasmids, however, are presently 

preferred for expression of calcium channel receptors. 

p. Expression in Xenopus laevis oocytes of RNA 
encoding human neuronal calcium channel subunite 

Various combinations of the transcripts of DNA encoding 
the human neuronal a 1D , a 2 and p 1 subunit s prepared in vitro 
were injected into Xenopus laevis oocytes. Those injected 
with combinations that included a 1D exhibited voltage-activated 
barium currents . 
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1. Preparation of transcripts 

Transcripts encoding the human neuronal calcium channel 
a iv , a 2 and p l subunits were synthesized according to the 
instructions of the mCAP mRNA CAPPING KIT (Strategene, La 
Jolla, CA catalog #200350). Plasmids pVDCC III.RBS(A), 
containing pcDNAl and the a 10 cDNA that begins with aribosome 
binding site and the eighth ATG codon of the coding sequence 
(see Example III. A. 3. d), plasmid pHBCaCHc^A containing pcDNAl 
and an a 2 subunit cDNA (see Example IV) , and plasmid 
pHBCaCH/? lb RBS (A) containing pcDNAl and the jSj DNA lacking 
intron sequence and containing a ribosome binding site (see 
Example III), were linearized by restriction digestion. The 
or 1D cDNA- and or 2 subunit -encoding plasmids were digested with 
Xhol, and the p x subunit- encoding plasmid was digested with 
EcoRV. The DNA insert was transcribed with T7 RNA polymerase. 

2. Injection of oocytes 

Xenopus laevis oocytes were isolated and def olliculated 
by collagenase treatment and maintained in 100 mM NaCl, 2 mM 
KC1, 1.8 mM CaCl 2 , 1 mM MgCl 2 , 5 mM HEPES, pH 7.6, 20 fig/wl 
ampicillin and 25 /ig/ml streptomycin at 19-25 C C for 2 to 5 
days after injection and prior to recording. For each 
transcript that was injected into the oocyte, 6 ng of the 
specific mRNA was injected per cell in a total volume of 50 
nl, 

3 . Intracellular voltage recordings 

Injected oocytes were examined for voltage -dependent 
barium currents using two-electrode voltage clamp methods 
[Dascal, N. (1987) CRC Crlt. Rev. Biochem. 22:317] . The 
pClamp (Axon Instruments) software package was used in 
conjunction with a Labmaster 125 kHz data acquisition 
interface to generate voltage commands and to acquire and 
analyze data. Quattro Professional was also used in this 
analysis. Current signals were digitized at 1-5 kHz, and 
filtered appropriately. The bath solution contained of the 
following: 4 0 mM BaCl 2 , 3 6 mM tetraethyl ammonium chloride 
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(TEA-Cl), 2 mM KC1, 5 mM 4-aminopyridine, 0.15 mM niflumic 
acid, 5 mM HEPES , pH 7 . 6 . 

a. Electrophysiological analysis of oocytes 
injected with transcripts encoding the 
human neuronal calcium channel a,, a, and 
^-eub units 

Uninjected oocytes were examined by two-electrode voltage 
clamp methods and a very small (25 nA) endogenous inward Ba 2 * 
current was detected in only one of seven analyzed cells. 

Oocytes coinjected with or 1D , a 2 and fi 1 subunit transcripts 
expressed sustained inward barium currents upon depolarization 
of the membrane from a holding potential of -90 mV or -50 mV 
(154 + 129 nA, n=21) . These currents typically showed little 
inactivation when test pulses ranging from 140 to 700 msec, 
were administered. Depolarization to a series of voltages 
revealed currents that first appeared at approximately -30 mV 
and peaked at approximately 0 mV. 

Application of the DHP Bay K 8644 increased the magnitude 
of the currents, prolonged the tail currents present upon 
repolarization of the cell and induced a hyperpol arizing shift 
in current activation. Bay K 8644 was prepared fresh from a 
stock solution in DMSO and introduced as a lOx concentrate 
directly into the 60 fil bath while the perfusion pump was 
turned off . The DMSO concentration of the final diluted drug 
solutions in contact with the cell never exceeded 0.1%. 
Control experiments showed that 0.1% DMSO had no effect on 
membrane currents . 

Application of the DHP antagonist nifedipine (stock 
solution prepared in DMSO and applied to the cell as described 
for application of Bay K 8644) blocked a substantial fraction 
(91 + 6%, n=7) of the inward barium current in oocytes 
coinjected with transcripts of the a lD , a 2 and fi x subunits . A 
residual inactivating component of the inward barium current 
typically remained after nifedipine application. The inward 
barium current was blocked completely by 50 fM Cd 2 * , but only 
approximately 15% by 100 //M Ni 2 *. 
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The effect of wCgTX on the inward barium currents in 
oocytes co-injected with transcripts of the a 1D , a 2 , and ft 
subunits was investigated. wCgTX (Bachem, Inc., Torrance CA) 
was prepared in the 15 mM BaCl 2 bath solution plus 0.1% 
cytochrome C (Sigma) to serve as a carrier protein. Control 
experiments showed that cytochrome C had no effect on 
currents. A series of voltage pulses from a -90 mV holding 
potential to 0 mV were recorded at 20 msec, intervals. To 
reduce the inhibition of wCgTX binding by divalent cations, 
recordings were made in 15 mM BaCl 2 , 73.5 mM 
tetraethylammonium chloride, and the remaining ingredients 
identical to the 40 mM Ba 3 * recording solution. Bay K 8644 was 
applied to the cell prior to addition to uCgTX in order to 
determine the effect of coCgTX on the DHP-sensitive current 
component that was distinguished by the prolonged tail 
currents. The inward barium current was blocked weakly (54 ± 
29%, n=7) and reversibly by relatively high concentrations 
(10-15 /iM) of uCgTX. The test currents and the accompanying 
tail currents were blocked progressively within two to three 
minutes after application of uCgTX, but both recovered 
partially as the uCgTX was flushed from the bath. 

b. Analysis of oocytes injected with only 
a transcripts encoding the human 
neuronal calcium channel a 1D or 
transcripts encoding an a 1D and other 
subunits 

The contribution of the a 2 and ft subunits to the inward 
barium current in oocytes injected with transcripts encoding 
the «„, oi 2 and ft subunits was assessed by expression of the 
«„ subunit alone or in combination with either the ft subunit 
or the a 2 subunit. In oocytes injected with only the 
transcript of a a 1B cDNA, no Ba 3 * currents were detected (n=3) . 
in oocytes injected with transcripts of a 1D and ft cDNAs , small 
(108 + 39 nA) Ba 2+ currents were detected upon depolarization 
of the membrane from a holding potential of -90 mV that 
resembled the currents observed in cells injected with 
transcripts of «„. * 2 and ft cDNAs , although the magnitude of 
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the current was less. In two of the four o6cytes 
with transcripts of the a 1D -encoding and ft -encoding DNA, the 
Ba currents exhibited a sensitivity to Bay K 8644 that was 
similar to the Bay K 8644 sensitivity of Ba- currents 
expressed in oocytes injected with transcripts encoding the a 
a 2 _ and ft subunits. " 1D 

Three of five oocytes injected with transcripts encoding 
the a 1D and a 2 subunits exhibited very small Ba 2 * currents (15- 
30 nA) upon depolarization of the membrane from a holding 
potential of -90 mV. These barium currents showed little or 
no response to Bay K 8644. 

c. Analysis of oocytes injected with 
transcripts encoding the human neuronal 
calcium channel a 2 and/or ft subunit 

To evaluate the contribution of the «„ ^-subunit to the 
inward barium currents detected in oocytes co- injected with 
transcripts encoding the «„. a 2 and ^ subunits , o5 
injected with transcripts encoding the human neuronal calcium 
channel or 2 and/or ft subunits were assayed for barium currents 
Oocytes injected with transcripts encoding the a 2 subunit 
displayed no detectable inward barium currents (n=5) 
Oocytes injected with transcripts encoding a ft subunit 
displayed measurable (54 4 23 nA, n=5) inward barium currents 
upon depolarization and oocytes injected with transcripts 
encoding the a 2 and ft subunits displayed inward barium 
currents that were approximately 50% larger (80 ± 61 nA, n=l 8 ) 
than those detected in oocytes injected with transcripts of 
the ft -encoding DNA only. 

The inward barium currents in oocytes injected with 
transcripts encoding the ft subunit or a 2 and ft subunits 
typacally were first observed when the membrane was 
depolarized to -30 mV from a holding potential of - 90 mV and 
peaked when the membrane was depolarized to io to 20 mV 
Macroscopically, the currents in oocytes injected with 
transcripts encoding the a 2 and ft subunits or with transcripts 
encoding the ft subunit were indistinguishable. m contrast 
to the currents in oocytes co- injected with transcripts of a 
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a 2 and (3 1 subunit cDNAs, these currents showed a significant 
inactivation during the test pulse and a strong sensitivity to 
the holding potential. The inward barium currents in oocytes 
co- injected with transcripts encoding the ot 2 and 0 X subunits 
usually inactivated to 10-60% of the peak magnitude during a 
140-msec pulse and were significantly more sensitive to 
holding potential than those in oocytes co- injected with 
transcripts encoding the a 1D , ot 2 and P 1 subunits. Changing the 
holding potential of the membranes of oocytes co- injected with 
transcripts encoding the ot 2 and p 1 subunits from -90 to -50 mV 
resulted in an approximately 81% (n=ll) reduction in the 
magnitude of the inward barium current of these cells. In 
contrast, the inward barium current measured in oocytes co- 
injected with transcripts encoding the ct xv , a 2 and 0 2 subunits 
were reduced approximately 24% (n=ll) when the holding 
potential was changed from -90 to -50 mV. 

The inward barium currents detected in oocytes injected 
with transcripts encoding the ot 2 and /3 1 subunits were 
pharmacologically distinct from those observed in oocytes co- 
injected with transcripts encoding the a 1D , a 2 and ft x subunits. 
Oocytes injected with transcripts encoding the a 2 and I3 1 
subunits displayed inward barium currents that were 
insensitive to Bay K 8644 (n=ll) . Nifedipine sensitivity was 
difficult to measure because of the holding potential 
sensitivity of nifedipine and the current observed in oocytes 
injected with transcripts encoding the ot 2 and & x subunits. 
Nevertheless, two oocytes that were co- injected with 
transcripts encoding the a 2 and /3 2 subunits displayed 
measurable (25 to 45 nA) inward barium currents when 
depolarized from a holding potential of -50 mV. These 
currents were insensitive to nifedipine (5 to 10 /zM) . The 
inward barium currents in oocytes injected with transcripts 
encoding the a 2 and /S a subunits showed the same sensitivity to 
heavy metals as the currents detected in oocytes injected with 
transcripts encoding the a 1D , a 2 and ^ subunits. 
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The inward barium current detected in oocytes injected 
with transcripts encoding the human neuronal « a and fil subunits 
has pharmacological and biophysical properties that resemble 
calcxum currents in uninjected Xenopus oocytes. Because the 
amino acids of this human neuronal calcium channel fil subunit 
lack hydrophobic segments capable of forming transmembrane 
domains, it is unlikely that recombinant 0 X subunits alone can 
form an ion channel. it is more probable that a homologous 
endogenous subunit exists in oocytes and that the activity 
mediated by such an a, subunit is enhanced by expression of a 
human neuronal 0 1 subunit. 

E * SSJn!? 1 ^ ° f DN * e ? COding human neuronal calcium 
channel or aB and fi M subunits in HEK cells 

1. Transfection of HEK cells 

The transient expression of the human neuronal a lB lf <* 2b 
and (3^ subunits was studied in HEK293 cells. The HEK293 
cells were grown as a monolayer culture in Dulbecco's modified 
Eagle's medium (Gibco) containing 5% defined- supplemented 
bovine calf serum (Hyclone) plus penicillin G (100 U/ml) and 
steptomycin sulfate (100 jzg/ml) . HEK293 cell transf ections 
were mediated by calcium phosphate as described above. 
Transfected cells were examined for inward Ba 2 * currents (J B ) 
mediated by volt age -dependent Ca 2 ' channels. 

Cells were transfected (2 x io« per polylysine-coated 
plate. Standard transf ections (10-cm dish) contained 8 fig 
of pcDNAof 1B .j , 5 M g of pHBCaCIfojA, 2 „g pHBCaCH/S lb RBS (A) (see 
Examples II.A.3, IV.B. and III) and 2 M g of CMV0 (Clontech) fi- 
glactosidase expression plasmid, and pUCie to maintain a 
constant mass of 20 M g/ml . Cells were analyzed 48 to 72 hours 
after transfection. Transfection efficiencies (±10%), which 
were determined by in situ histochemical staining for 
0-galactosidase activity (Sanes et al. (1986) EMBO J., 
5:3133), generally were greater than 50%. 

2 " cijrents hySi0l ° giCal anal y sie of transf ectant 
a. Materials and methods 
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Properties of recombinant ly expressed Ca 2+ channels were 
studied by whole cell patch-clamp techniques. Recordings were 
performed on transfected HEK293 cells 2 to 3 days after 
transfection. Cells were plated at 100,000 to 300,000 cells 
per polylysine- coated, 35-mm tissue culture dishes (Falcon, 
Oxnard, CA) 24 hours before recordings. Cells were perfused 
with 15 mM BaCl 2 , v 125 mM choline chloride, 1 mM MgCl 2 , and 
10 mM Hepes (pH = 7.3) adjusted with tetraethylammonium 
hydroxide (bath solution) . Pipettes were filled with 135 mM 
CsCl, 10 mM EGTA, 10 mM Hepes, 4 mM Mg-adenosine triphosphate 
(pH = 7.5) adjusted with tetraethylammonium hydroxide. 
Sylgard (Dow-Corning, Midland, MI) -coated, fire-polished, and 
filled pipettes had resistances of 1 to 2 megohm before gigohm 
seals were established to cells. 

Bay K 8644 and nifedipine (Research Biochemicals , Natick, 
MA) were prepared from stock solutions (in dimethyl sulfoxide) 
and diluted into the bath solution. The dimethyl sulfoxide 
concentration in the final drug solutions in contact with the 
cells never exceeded 0.1%. Control experiments showed that 
0.1% dimethyl sulfoxide had no efect on membrane currents. 
wCgTX (Bachem, Inc., Torrance CA) was prepared in thfe 15 mM 
BaCl 2 bath solution plus 0.1% cytochrome C (Sigma, St. Louis 
MO) to serve as a carrier protein. Control experiments 
showed that cytochrome C had no effect on currents. These 
drugs were dissolved in bath solution, and continuously 
applied by means of puffer pipettes as required for a given 
experiment. Recordings were performed at room temperature 
(22° to 25°C) . Series resistance compensation (70 to 85%) was 
employed to minimize voltage error that resulted from pipette 
access resistance, typically 2 to 3.5 megohm. Current signals 
were filtered (-3 dB, 4-pole Bessel) at a frequency of 1/4 to 
1/5 the sampling rate, which ranged from 0.5 to 3 kHz. 
Voltage commands were generated and data were acquired with 
CLAMPEX (pClamp, Axon Instruments, Foster City, CA) . All 
reported data are corrected for linear leak and capacitive 
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components. Exponential fitting of currents was performed 
with CLAMPFIT (Axon Instruments, Foster City, CA) . 
b. Results 

Transfectants were examined for inward Ba 2 * current 
U Ba ). Cells cotransfected with DNA encoding a„. J# a 2b , and 0 i 
subunits expressed high-voltage-activated Ca 2 * channels. j, 
first appeared when the membrane was depolarized from 
holding potential of -90 mV to -20 mV and peaked in magnitude 
at 10 mv. Thirty-nine of 95 cells (12 independent 

transfections) had J Ba that ranged from 30 to 2700 pA, with a 
mean of 433 pA. The mean current density was 26 pA/pF, and 
the highest density was 150 pA/pF. The J Ba typically increased 
by 2- to 20-fold during the first 5 minutes of recording. 
Repeated depolarizations during long records often revealed 
rundown of J Ba usually not exceeding 20% within 10 min. J Ba 
typically activated within 10 ms and inactivated with both a 
fast time constant ranging from 46 to 105 ms and a slow time 
constant ranging from 291 to 453 ms (n = 3) . Inactivation 
showed a complex voltage dependence, such that J Ba elicited at 
*20 mV inactivated more slowly than I Ba elicited at lower test 
voltages, possibly a result of an increase in the magnitude of 
slow compared to fast inactivation components at higher test 
voltages. 

Recombinant of„_ J a ab 0 1 . a channels were sensitive to holding 
potential. Steady-state inactivation of j Ba , measured after 
a 30- to 60-s conditioning at various holding potentials, was 
approximately 50% at holding potential between -60 and -70 mV 
and approximately 90% at -4 0 mV. Recovery of J Ba from 
inactivation was usually incomplete, measuring 55 to 75% of 
the original magnitude within 1 min. after the holding 
potential was returned to more negative potentials, possibly 
indicating some rundown or a slow recovery rate. 

Recombinant a 1B . lC y Jb ^. 2 channels were also blocked 
irreversibly by w-CgTx concentrations ranging from 0.5 to 
10 pM during the time scale of the experiments. Application 
of 5 M M toxin (n = 7) blocked the activity completely within 
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2 min., and no recovery of J Ba was observed after washing 
cj-CgTx from the bath for up to 15 min. d 2+ blockage (50 /M) 
was rapid, ' complete, and reversible; the DHPs Bay K 8644 
(1 /iM; n = 4) or nifedipine (5 /iM; n = 3) had no discernable 
effect . 

Cells cotransf ected with DNA encoding a 1B . x , a 2b , and 0!_ 2 
subunits predominantly displayed a single class of saturable, 
high-affinity u-CgTx binding sites. The determined 

dissociation constant (JQ) value was 54.6 ± 14.5 pM (n = 4) . 
Cells transfected with the vector containing only 
/3-galactosidase-encoding DNA or a 2b /3- encoding DNA showed no 
specific binding. The binding capacity (B^J of the 
<*iB-i<*2b0- transfected cells was 28,710 ± 11,950 sites per cell 
(n = 4) . 

These results demonstrate that a^o^/S^- trans f ected 
cells express high-voltage-activated, inactivating Ca 2 * channel 
activity that is irreversibly blocked by o)-CgTx, insensitive 
to DHPs, and sensitive to holding potential. The activation 
and inactivation kinetics and voltage sensitivity of the 
channel formed in these cells are generally consistent with 
previous characterizations of neuronal N-type Ca 2+ channels. 

F Expression of DNA encoding human neuronal calcium 

channel a 1B _ x , a 1B _ 2 , a 2B , P^ 2 and jS^ subunits in HEK 

cells 

Significant Ba 2+ currents were not detected in 
untransf ected HEK2 93 cells. Furthermore, untransf ected HEK2 93 
cells do not express detectable a)-CgTx GVIA binding sites. 

In order to approximate the expression of a homogeneous 
population of trimeric a 1B , Qf 2b and S x protein complexes in 
transfected HEK293 cells, the a 1B , o? 2b and 6 a expression levels 
were altered. The efficiency of expression and assembly of 
channel complexes at the cell surface were optimized by 
adjusting the molar ratio of <* 1B , a 2b and expression plasmids 
used in the transf ections . The transf ectants were analyzed 
for mRNA levels, w-CgTx GVIA binding and Ca 2+ channel current 
density in order to determine near optimal channel expression 
in the absence of immunological reagents for evaluating 
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protein expression. Higher molar ratios of * 2b appeared to 
increase calcium channel activity. 
1. Transfections 
HEK293 cells were maintained in DMEM (Gibco #320-1965AJ) 
5.5% Defined/supplemented bovine calf serum (Hyclone #A-2l5i- 
D, 100 U/ml penicillin G and 100 /ig/ml streptomycin. Ca 2 '- 
phosphate based transient transfections were performed and 
analyzed as described above. Cells were co-transf ected with 
either 8 fig pcDNAlo^., (described in Example II. c) , 5 fig 
pHBCaCHo,^ (see, Example IV.B.), 2 fig pHBCaCH/3 lb RBS (A) (^ 2 
expression plasmid; see Examples III. A. and IX . E . ) , and 2 fig 
pCMV/J-gal [Clontech, Palo Alto, CA) (2:1.8:1 molar ratio of 
Ca 2 * channel subunit expression plasmids) or with 3 Mg 
pcDNAlo^ or pcDNAlo,^, n. 25 fig pHB CaCHo( 2 A , 0.75 or 1.0 fig 
P HBCaCH,J lb RBS (A) or pcDNAl/J^ and 2 fig pCMV/J-gal (2:10 9-1 
molar ratio of Ca" channel subunit expression plasmids) 
Plasmid pCMV7?-gal, a S-galactosidase expression plasmid, was 
included in the transfections as a marker to permit 
transfection efficiency estimates by histochemical staining. 
When less than three subunits were expressed, pCMVPL2, a pCMV 
promoter-containing vector that lacks a cDNA insert, was 
substituted to maintain equal moles of pCMV-based DNA in the 
transfection. pUC18 DNA was used to maintain the total mass 
of DNA in the transfection at 20 jtg/plate. 

RNA from the transfected cells was analyzed by Northern 
blot analysis for calcium channel subunit mRNA expression 
using random primed 32 P-labeled subunit specific probes. 
HEK293 cells co-transf ected with a 1B . lf a !h and fi x . a expression 
plasmids (8, 5 and 2 M g, respectively; molar ratio = 2:1.8:1) 
did not express equivalent levels of each Ca 2 * channel subunit 
mRNA. Relatively high levels of «„, and fi„ mRNAs were 
expressed, but significantly lower levels of a 2b mRNA were 
expressed. Based on autoradiograph exposures required to 
produce equivalent signals for all three mRNAs , a 2b transcript 
levels were estimated to be 5 to 10 times lower than a 1B 1 and 
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fi lm2 transcript levels. Untransf ected HEK293 cells did not 
express detectable levels of a 1B . t , a 2h , or p^. 2 mRNAs. 

To achieve equivalent Ca 2 ' channel subunit mRNA expression 
levels, a series of transf ections was performed with various 
amounts of a„. lf a 2b and fl M expression plasmids. Because the 
«„ , and mRNAs were expressed at very high levels compared 
to a 2b mRNA, the mass of and plasmids was lowered and 

the mass of a 2b plasmid was increased in the transfection 
experiments. Co- transfection with 3, 11.25 and 0.75 ng of «„. 

<* 2b and fi x . 3 expression plasmids, respectively (molar ratio 
• 2:10.9:1), approached equivalent expression levels of each 
Ca 2 ' channel subunit mRNA. The relative molar quantity of a 2b 
expression plasmid to « u , and fi,. a expression plasmids was 
increased 6-fold. The mass of and fi lm2 plasmids in the 

transfection was decreased 2.67-fold and the mass of a 2b 
plasmid was increased 2.25-fold. The 6-fold molar increase of 
* 2b relative to « u , and B t . 2 required to achieve near equal 
abundance mRNA levels is consistent with the previous 5- to 
10-fold lower estimate of relative a 2b mRNA abundance. u-CgTx 
GVIA binding to cells transfected with various amounts of 
expression plasmids indicated that the 3, 11.25 and 0.75 M 9 of 
a 1B1 , a 2b and fi,. a plasmids, respectively, improved the level of 
cell surface expression of channel complexes. Further 
increases in the mass of a 2b and expression plasmids while 
a B 1 was held constant, and alterations in the mass of the 
expression plasmid while a 2b and fc,. 2 were held constant, 
indicated that the cell surface expression of u-CgTx GVIA 
binding sites per cell was nearly optimal. All subsequent 
transf ections were performed with 3, 11.25 and 0.75 « or 1.0 
M g of or a 1M , <* 2b and /B w or 0 lO expression plasmids, 

respectively . 

2 » 5 I-w-CgTx GVIA binding to transfected cells 

Statistical analysis of the K d and B^ values was 
performed using one-way analysis of variance (ANOVA) followed 
by the Tukey-Kramer test for multiple pairwise comparisons 
(psO.05) . 
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Combinations of human voltage-dependent Ca 2 - channel 
subunits, a„. 1# a 1B . 2< a2b> ^ and ^ were analyzed fQr 

saturation binding of » s I-c-CgTx GVIA. About 200,000 cells 
were used per assay, except for the a 1B . lt Wab and ^ 

z ot 2h combinations which were assayed with l x 10 6 cells per tube 
The transfected cells displayed a single-class of saturable 
high-affinity binding sites. The values for the dissociation 
constants (Kd ) and binding capacities (B^) were determined for 
the different combinations. The results are summarized as 
follows : 

Subunit Combination K<s ( P M) B _ (slte6/cell) 

CXn-iOt^., 5 4. 9 ± llml {n=4) 45,324 ± 15,606 

OfiB-i^ao 53.2 ±3.6 (n=3) 91,004 ± 37,654 

Q ' 1B - A - 2 17 " 9 ± I- 9 <n=3) 5,756 ± 2,163 

a «-^x-3 17.9 ± 1.6 (n=3) 8,729 ± 2 ,980 

^iB-a^b 84/6 ±15.3 (n=3) 2 ,256 ± 356 

° flB - 1 31 • 7 ± 4 -2 (n=3) 757 ± 128 

°W* 2b 0a- 2 53.0 ± 4.8 (n=3) 19,371 ± 3,798 

Ofa-.Of^., 44.3 ± 8.1 (n=3) 37,652 ± 8,129 

a »-^i-2 16.4 ±1.2 (n=3) 2,126 ± 412 

a,B ^'- 3 22 - 2 ± 5.8 (n=3) 2,944 ± 1,168 

^B^b N.D.* (n=3) N . D _ 

* N.D. = not^delectable N ' D ' N - D - 

Cells transfected with subunit combinations lacking 
either the or the a I8 . 2 subunit did not exhibit any 

detectable 125 I-w-CgTx GVIA binding ( s 600 sites/cell) . 125 l-u- 
CgTx GVIA binding to HEK293 cells transfected with a 1B . 2 alone 
or ct 1B _ 2 a 2b was too low for reliable Scatchard analysis of the 
data. Comparison of the K, and BmaJt values revealed several 
relationships between specific combinations of subunits and 
the binding affinities and capacities of the transfected 
cells. in cells transfected with all three subunits, (a 1B . 
iffab/W, OfjB^Qfj^.,-, ct^a^e^- , or or^o^^-transf ectants) the 
K, values were indistinguishable (p>0.05), ranging from 44.3 
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± 8.1 pM to 54.9 ± 11.1 pM. In cells transfected with two- 
subunit combinations lacking the a 2h subunit (ot^^p^i <*iB-i0i-3' 
a 1B - 2 0i-2 or aiB^Cio) the values were significantly lower than 
the three-subunit combinations (p<0.01), ranging from 16.4 ± 
1.2 to 22.2 ± 5.8 pM. Cells transfected with only the a 1B>1 
subunit had a K d value of 31.7 + 4.2 pM, a value that was not 
different from the two-subunit combinations lacking a 2b 
(p<0.05). As with the comparison between the four a 1B a 2h p 1 
versus a lB 0 a combinations, when the a 1B . x was co-expressed with 
a 2b , the K d increased significantly (p<0.05) from 31-7 ± 4.2 to 
84.6 ± 5.3 pM. These data demonstrate that co-expression of 
the a 2b subunit with a 1B . a , o<ib-i0i-2' <*ib-i£i-3* <*ib-20i-2 or <*ib-20i-3 
subunit combinations results in lower binding affinity of the 
cell surface receptors for 125 I-o>-CgTx GVIA. The values 

of cells transfected with various subunit combinations also 
differed considerably. Cells transfected with the a 1B _^ subunit 
alone expressed a low but detectable number of binding sites 
(approximately 750 binding sites/cell) . When the a aB . x subunit 
was co-expressed with the a 2b subunit, the binding capacity 
increased approximately three-fold while co-expression of a p x . 
2 or 0 1O subunit with a 1B . x resulted in 8- to 10-fold higher 
expression of surface binding. Cells transfected with all 
three subunits expressed the highest number of cell surface 
receptors. The binding capacities of cells transfected with 
ttiB-i«2b0i-3 or «iB-2«2b0i-3 combinations were approximately two- fold 
higher than the corresponding combinations containing the ft lm2 
subunit. Likewise, cells transfected with a 1B . 1 of 2b P x . 2 or 
ffiB-iOBbPao combinations expressed approximately 2.5-fold more 
binding sites per cell than the corresponding combinations con- 
taining a 1B . 2 . In all cases, co-expression of the a 2b 
subunit with o 1B and 0 X increased the surface receptor density 
compared to cells transfected with only the corresponding <*„ and 
p 1 combinations; approximately 8-fold for a^^P^. 10-fold for 
a».i«2b*io. 3-fold for a 1B _ 2 a 2t A_ 2 , and 13-fold for a 1B . 2 a 2b ^. 3 . Thus, 
comparison of the values suggests that the toxin-binding 

subunit, a 1B . x or a 1M . is more efficiently expressed and 
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assembled on the cell surface when co-ex-pressed with either the 
<* 2b or the or /3 M subunit, and most efficiently expressed 

when or 2b and p y subunit s are present. 
3 . Electrophysiology 

Functional expression of a^.^., and a^A., subunit 
combinations was evaluated using the whole-cell recording 
technique. Transfected cells that had no contacts with 
surrounding cells and simple morphology were used approximately 
4 8 hours after transfection for recording. The pipette 
solution was (in mM) 135 CsCl, 10 EGTA, 1 MgCl 2 , 10 HEPES , and 
4 mM Mg-ATP (pH 7.3, adjusted with TEA- OH) . The external 
solution was (in mM) 15 BaCl 2 , 125 Choline Cl, l M gCl 2 , and 10 
HEPES (pH 7.3, adjusted with TEA-OH) . w-CgTx GVIA (Bachem) was 
prepared in the external solution with 0.1% cytochrome C 
(Sigma) to serve as a carrier. Control experiments showed that 
cytochrome C had no effect on the Ba 2 * current. 

The macroscopic electrophysiological properties of Ba 2< 
currents in cells transfected with various amounts of the a 2b 
expression plasmid with the relative amounts of and /3 12 

plasmids held constant were examined. The amplitudes and 
densities of the Ba 2 * currents (15 mM BaCl 2 ) recorded from whole 
cells of these transf ectants differed dramatically. The 
average currents from 7 to n cells of three types of 
transf ect ions (no a 2b ; 2:1.8:1 [«„., :<y 2b :/3 2 _ 2 ] molar ratio; and 
2:10.9:1 [a^ :a 2h : fi^] molar ratio) were determined. The 
smallest currents (range: 10 to 205 pA) were recorded when a 2b 
was not included in the transfection, and the largest currents 
(range: 50 to 8300 pA) were recorded with the 2:10. 9:1 ratio of 
O'lB-iQ'sb^i-a plasmids, the ratio that resulted in near equivalent 
mRNA levels for each subunit transcript. When the amount of a 2b 
plasmid was adjusted to yield approximately an equal abundance 
of subunit mRNAs, the average peak Ba 2+ current increased from 
433 pA to 1,824 pA (4.2-fold) with a corresponding increase in 
average current density from 26 pA/pF to 127 pA/pF (4.9-fold). 
This increase is in the presence of a 2.7-fold decrease in the 
mass of a 1B . 2 and 0 2 _ 2 expression plasmids in the transf ections . 
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In all transf ections , the magnitudes of the Ba 24 currents did 
not follow a normal distribution. 

To compare the subunit combinations and determine the 
effects of a 2b , the current -voltage properties of cells 
transf ected with of 1B . 1 S 1 _ 2 or with ot 1B 'i a 2b&i-2 ^ n either the 2:1.8:1 
(ot 1B . 1 :a 2 b:0i-2) tnolar ratio or the 2:10.9:1 (a 1B . x ia 2b : 0 1 . 2 ) molar 
ratio transf ectants were examined. The extreme examples of no 
a 2b and 11.25 /zg a 2b (2:10,9:1 molar ratio) showed no significant 
differences in the current voltage plot at test potentials 
between 0 mV and +40 mV (p<0.05). The slight differences 
observed at either side of the peak region of the current 
voltage plot were likely due to normalization. The very small 
currents observed in the ct 1B -iPi-2 transfected cells have a 
substantially higher component of residual leak relative to the 
barium current that is activated by the test pulse. When the 
current voltage plots are normalized, this leak is a much 
greater component than in the <2i B -i<*2b0i-2 transfected cells and 
as a result, the current-voltage plot appears broader. This is 
the most likely explanation of the apparent differences in the 
current voltage plots, especially given the fact that the 
current-voltage plot for the a 1B -i0i- 2 transfected cells diverge 
on both sides of the peak. Typically, when the voltage- 
dependence activation is shifted, the entire current -voltage 
plot is shifted, which was not observed. To qualitatively 
compare the kinetics of each, the average responses of test 
pulses from -90 mV to 10 mV were normalized and plotted. No 
significant differences in activation or inactivation kinetics 
of whole-cell Ba 2+ currents were observed with any combination. 

G. Expression of DNA encoding human neuronal calcium 
channel of XB . 3 a aB /8 a . 3 and et 19ml a n fi^ subunits in HEK cells 

Functional expression of the a 1E . 1 Qf 2B i3 1 . 3 and oi 1E . 2 a 2B P lm ^ g as 
well as Qf 1E _ 3 was evaluated using the whole cell recording 
technique . 

1 . Methods 
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Recordings were performed on transiently transfected HEK 
293 cells two days following the transf ection, from cells that 
had no contacts with surrounding cells and which had simple 
morphology. 

The internal solution used to fill pipettes for recording 
the barium current from the transfected recombinant calcium 
channels was (in mM) 135 CsCl, 10 EGTA, 1 MgCl 2 , 10 HEPES, and 

4 mM Mg-ATP ( P H 7.4-7.5, adjusted with TEA-OH). The external 
solution for recording the barium current was (in mM) 15 BaCl 2 , 
150 Choline CI, l MgCl 2 , and 10 HEPES and 5 TEA-OH (pH 7.3, 
adjusted with TEA-OH) . in experiments in which Ca 2+ was 
replaced for Ba 2 \ a Laminar flow chamber was used in order to 
completely exchange the extracellular solution and prevent any 
mixing of Ba 2+ and Ca 2 *. u-CgTx GVIA was prepared in the 
external solution with 0.1% cytochrome C to serve as a carrier, 
the toxin was applied by pressurized puffer pipette. Series 
resistance was compensated 70-85% and currents were analyzed 
only if the voltage error from series resistance was less than 

5 mV. Leak resistance and capacitance was corrected by 
subtracting the scaled current observed with the P/-4 protocol 
as implemented by pClamp (Axon Instruments) . 

2. Electrophysiology Results 

Cells transfected with a^a^fi^ or a 1E . 3 a 2b ^ 10 showed strong 
barium currents with whole cell patch clamp recordings. Cells 
expressing ^ie-3^2b^i-3 had larger peak currents than those 
expressing Ou.^.,. In addition, the kinetics of activation 
and inactivation are clearly substantially faster in the cells 
expressing a 1E calcium channels. HEK 293 cells expressing of 1E . 3 
alone have a significant degree of functional calcium channels, 
with properties similar to those expressing a^cx^P^ but with 
substantially smaller peak barium currents. Thus, with a 1E , the 
or 2 and 0 2 subunits are not required for functional expression of 
or 1E mediated calcium channels, but do substantially increase the 
number of functional calcium channels. 

Examination of the current voltage properties of a 1E a 2b je io 
expressing cells indicates that a iE . 3 o- 21 A-3 is a high-voltage 
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activated calcium channel and the peak current is reached at a 
potential only slightly less positive than other neuronal 
calcium channels also expressing a 2b and (3 lt and a 1B and a XD . 
Current voltage properties of a 1Eml a 2h P lm3 and or 1E - 3 a 2b0i-3 are 
statistically different from those of a 1B _ x a ab /S 1 . 3 - Current 
voltage curves for a XE _ x a 2b j8 x _ 3 and a 1E _ 3 a 2b |8 x _3 peak at approximately 
+5mV, as does the current voltage curve for a 1E _ 3 alone. 

The kinetics and voltage dependence of inactivation using 
both prepulse (200 ms) and steady-state inactivation was 
examined. a 1E mediated calcium channels are rapidly inactivated 
relative to previously cloned calcium channels and other high 
volt age -activated calcium channels. a 1E _ 3 a 2b 0 x . 3 mediated calcium 
channels are inactivated rapidly and are thus sensitive to 
relatively brief (200 ms) prepulses as well as long prepulses 
(>20s steady state inactivation), but recover rapidly from 
steady state inactivation. The kinetics of the rapid 
inactivation has two components, one with a time constant of 
approximately 25 ms and the other approximately 400 ms. 

To determine whether a XE mediated calcium channels have 
properties of low voltage activated calcium channels, the 
details of tail currents activated by a test pulse ranging -60 
to +90 mV were measured at -60 mV. Tail currents recorded at - 
60 mV could be well fit by a single exponential of 150 to 300 
lis; at least an order of magnitude faster than those typically 
observed with low voltage- activated calcium channels. 

HEK 293 cells expressing a 1E . 3 or 2b 0 x _ 3 flux more current with 
Ba 2+ as the charge carrier and currents carried by Ba 2+ and Ca 2+ 
have different current -voltage properties. Furthermore, the 
time course of inactivation is slower and the amount of 
prepulse inactivation less with Ca 2+ as the charge carrier. 

While the invention has been described with some 
specificity, modifications apparent to those with ordinary 
skill in the art may be made without departing from the scope 
of the invention. Since such modifications will be apparent to 
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those of skill in the art, it is intended that this invention 
be limited only by the scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: THE SALK INSTITUTE BIOTECHNOLY/ INDUSTRIAL ASSOCIATES 

(B) STREET: 505 COAST BLVD SOUTH, SUITE 300 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: USA 

<F) POSTAL CODE (ZIP) : 92037 

(ii) TITLE OF INVENTION: HUMAN CALCIUM CHANNEL COMPOSITIONS AND 
METHODS 

(iii) NUMBER OF SEQUENCES: 3 8 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/149,097 
<B) FILING DATE: 5-NOV-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/105,536 

(B) FILING DATE: ll-AUG-1993 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 511.. 6996 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1..510 

(ix) FEATURE: 
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(A) NAME /KEY : 3'UTR 

(B) LOCATION: 6994 7635 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGGCGAGCGC CTCCGTCCCC GGATGTGAGC TCCGGCTGCC CGCGGTCCCG AGCCAGCGGC 60 

GCGCGGGCGG CGGCGGCGGG CACCGGGCAC CGCGGCGGGC GGGCAGACGG GCGGGCATGG 12 0 

GGGGAGCGCC GAGCGGCCCC GGCGGCCGGG CCGGCATCAC CGCGGCGTCT CTCCGCTAGA 180 

GGAGGGGACA AGCCAGTTCT CCTTTGCAGC AAAAAATTAC ATGTATATAT TATTAAGATA 240 

ATATATACAT TGGATTTTAT TTTTTTAAAA AGTTTATTTT GCTCCATTTT TGAAAAAGAG 300 

AGAGCTTGGG TGG CGAGCGG TTTTTTTTTA AAATCAATTA TCCTTATTTT CTGTTATTTG 360 

TCCCCGTCCC TCCCCACCCC CCTGCTGAAG CGAGAATAAG GGCAGGGACC GCGGCTCCTA 42 0 

CCTCTTGGTG ATCCCCTTCC CCATTCCGCC CCCGCCCCAA CGCCCAGCAC AGTGCCCTGC 480 

ACACAGTAGT CGCTCAATAA ATGTTCGTGG ATG ATG ATG ATG ATG ATG ATG AAA 534 

Met Met Met Met Met Met Met Lys 
1 5 

AAA ATG CAG CAT CAA CGG CAG CAG CAA GCG GAC CAC GCG AAC GAG GCA 582 
Lys Met Gin His Gin Arg Gin Gin Gin Ala Asp His Ala Asn Glu Ala 
10 15 20 

AAC TAT GCA AGA GGC ACC AGA CTT CCT CTT TCT GGT GAA GGA CCA ACT 630 
Asn Tyr Ala Arg Gly Thr Arg Leu Pro Leu Ser Gly Glu Gly Pro Thr 
25 30 35 40 

TCT CAG CCG AAT AGC TCC AAG CAA ACT GTC CTG TCT TGG CAA GCT GCA 678 
Ser Gin Pro Asn Ser Ser Lys Gin Thr Val Leu Ser Trp Gin Ala Ala 
45 50 ~ 55 

ATC GAT GCT GCT AGA CAG GCC AAG GCT GCC CAA ACT ATG AGC ACC TCT 726 
He Asp Ala Ala Arg Gin Ala Lys Ala Ala Gin Thr Met Ser Thr Ser 
60 65 70 

GCA CCC CCA CCT GTA GGA TCT CTC TCC CAA AGA AAA CGT CAG CAA TAC 774 
Ala Pro Pro Pro Val Gly Ser Leu Ser Gin Arg Lys Arg Gin Gin Tyr 
75 80 85 

GCC AAG AGC AAA AAA CAG GGT AAC TCG TCC AAC AGC CGA CCT GCC CGC 822 
Ala Lys Ser Lys Lys Gin Gly Asn Ser Ser Asn Ser Arg Pro Ala Arg 
90 95 100 

GCC CTT TTC TGT TTA TCA CTC AAT AAC CCC ATC CGA AGA GCC TGC ATT 870 
Ala Leu Phe Cys Leu Ser Leu Asn Asn Pro He Arg Arg Ala Cys He 
105 no us ^ 12 0 

AGT ATA GTG GAA TGG AAA CCA TTT GAC ATA TTT ATA TTA TTG GCT ATT 918 
Ser He Val Glu Trp Lys Pro Phe Asp He Phe He Leu Leu Ala He 
125 130 135 
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TTT GCC AAT TGT GTG GCC TTA GCT ATT TAC ATC CCA TTC CCT GAA GAT 966 
Phe Ala Asn Cys Val Ala Leu Ala lie Tyr lie Pro Phe Pro Glu Asp 
140 145 150 

GAT TCT AAT TCA ACA AAT CAT AAC TTG GAA AAA GTA GAA TAT GCC TTC 1014 
Asp Ser Asn Ser Thr Asn His Asn Leu Glu Lys Val Glu Tyr Ala Phe 
155 160 165 

CTG ATT ATT TTT ACA GTC GAG ACA TTT TTG AAG ATT ATA GCG TAT GGA 1062 
Leu He He Phe Thr Val Glu Thr Phe Leu Lys He He Ala Tyr Gly 
170 175 180 

TTA TTG CTA CAT CCT AAT GCT TAT GTT AGG AAT GGA TGG AAT TTA CTG 1110 
Leu Leu Leu His Pro Asn Ala Tyr Val Arg Asn Gly Trp Asn Leu Leu 
185 190 195 200 

GAT TTT GTT ATA GTA ATA GTA GGA TTG TTT AGT GTA ATT TTG GAA CAA 1158 
Asp Phe Val He Val He Val Gly Leu Phe Ser Val He Leu Glu Gin 
205 210 215 

TTA ACC AAA GAA ACA GAA GGC GGG AAC CAC TCA AGC GGC AAA TCT GGA 1206 
Leu Thr Lys Glu Thr Glu Gly Gly Asn His Ser Ser Gly Lys Ser Gly 
220 225 230 

GGC TTT GAT GTC AAA GCC CTC CGT GCC TTT CGA GTG TTG CGA CCA CTT 1254 
Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu 
235 240 245 

CGA CTA GTG TCA GGA GTG CCC AGT TTA CAA GTT GTC CTG AAC TCC ATT 13 02 

Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn Ser He 
250 255 260 

ATA AAA GCC ATG GTT CCC CTC CTT CAC ATA GCC CTT TTG GTA TTA TTT 1350 
He Lys Ala Met Val Pro Leu Leu His He Ala Leu Leu Val Leu Phe 
265 270 275 280 

GTA ATC ATA ATC TAT GCT ATT ATA GGA TTG GAA CTT TTT ATT GGA AAA 13 98 

Val He He He Tyr Ala He He Gly Leu Glu Leu Phe He Gly Lys 
285 290 295 

ATG CAC AAA ACA TGT TTT TTT GCT GAC TCA GAT ATC GTA GCT GAA GAG 1446 
Met His Lys Thr Cys Phe Phe Ala Asp Ser Asp He Val Ala Glu Glu 
300 305 310 

GAC CCA GCT CCA TGT GCG TTC TCA GGG AAT GGA CGC CAG TGT ACT GCC 1494 
Asp Pro Ala Pro Cys Ala Phe Ser Gly Asn Gly Arg Gin Cys Thr Ala 
315 320 325 

AAT GGC ACG GAA TGT AGG AGT GGC TGG GTT GGC CCG AAC GGA GGC ATC 1542 
Asn Gly Thr Glu Cys Arg Ser Gly Trp Val Gly Pro Asn Gly Gly He 
330 335 340 

ACC AAC TTT GAT AAC TTT GCC TTT GCC ATG CTT ACT GTG TTT CAG TGC 1590 
Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr Val Phe Gin Cys 
345 350 355 360 



BNSDOCID: <WO 9504822A1 J_> 



WO 95/04822 



PCT/US94/09230 



-118- 



ATC ACC ATG GAG GGC TGG ACA GAC GTG CTC TAC TGG ATG AAT GAT GCT 
lie Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Met Asn Asp Ala 
365 370 375 

ATG GGA TTT GAA TTG CCC TGG GTG TAT TTT GTC AGT CTC GTC ATC TTT 
Met Gly Phe Glu Leu Pro Trp Val Tyr Phe Val Ser Leu Val lie Phe 
380 365 390 

GGG TCA TTT TTC GTA CTA AAT CTT GTA CTT GGT GTA TTG AGC GGA GAA 
Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 
395 400 * 405 

TTC TCA AAG GAA AGA GAG AAG GCA AAA GCA CGG GGA GAT TTC CAG AAG 1782 
Phe Ser Lys Glu Arg Glu Lys Ala Lys Ala Arg Gly Asp Phe Gin Lys 
410 415 420 



1638 



1686 



1734 



1830 



1878 



CTC CGG GAG AAG CAG CAG CTG GAG GAG GAT CTA AAG GGC TAC TTG GAT 
Leu Arg Glu Lys Gin Gin Leu Glu Glu Asp Leu Lys Gly Tyr Leu Asp 
425 430 435 " 440 

™2 t? C ££ C °? T G ^ G GAC ATC GAT CCG GAG AAT GAG GAA GAA GGA 

Trp lie Thr Gin Ala Glu Asp lie Asp Pro Glu Asn Glu Glu Glu Gly 
445 450 455 

GGA GAG GAA GGC AAA CGA AAT ACT AGC ATG CCC ACC AGC GAG ACT GAG 1926 
Gly Glu Glu Gly Lys Arg Asn Thr Ser Met Pro Thr Ser Glu Thr Glu 
460 465 470 

TCT GTG AAC ACA GAG AAC GTC AGC GGT GAA GGC GAG AAC CGA GGC TGC 1974 
Ser Val Asn Thr Glu Asn Val Ser Gly Glu Gly Glu Asn Arg Gly Cvs 
4 ?5 480 485 

TGT GGA AGT CTC TGT CAA GCC ATC TCA AAA TCC AAA CTC AGC CGA CGC 2022 
Cys Gly Ser Leu Cys Gin Ala lie Ser Lys Ser Lys Leu Ser Arg Arq 
4 *° 495 500 

TGG CGT CGC TGG AAC CGA TTC AAT CGC AGA AGA TGT AGG GCC GCC GTG 207 0 

Trp Arg Arg Trp Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val 
505 510 515 520 

AAG TCT GTC ACG TTT TAC TGG CTG GTT ATC GTC CTG GTG TTT CTG AAC 
Lys Ser Val Thr Phe Tyr Trp Leu Val He Val Leu Val Phe Leu Asn 
525 530 535 

ACC TTA ACC ATT TCC TCT GAG CAC TAC AAT CAG CCA GAT TGG TTG ACA 2166 
Thr Leu Thr He Ser Ser Glu His Tyr Asn Gin Pro Asp Trp Leu Thr 
540 545 550 

CAG ATT CAA GAT ATT GCC AAC AAA GTC CTC TTG GCT CTG TTC ACC TGC 2214 
Gin He Gin Asp He Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys 
555 560 565 

GAG ATG CTG GTA AAA ATG TAC AGC TTG GGC CTC CAA GCA TAT TTC GTC 2262 
Glu Met Leu Val Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr Phe Val 
570 575 580 
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TCT CTT TTC AAC CGG TTT GAT TGC TTC GTG GTG TGT GGT GGA ATC ACT 2310 
Ser Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly lie Thr 
585 590 595 600 

GAG ACG ATC TTG GTG GAA CTG GAA ATC ATG TCT CCC CTG GGG ATC TCT 2358 
Glu Thr lie Leu Val Glu Leu Glu lie Met Ser Pro Leu Gly lie Ser 
605 610 615 

GTG TTT CGG TGT GTG CGC CTC TTA AGA ATC TTC AAA GTG ACC AGG CAC 2406 
Val Phe Arg Cys Val Arg Leu Leu Arg He Phe Lys Val Thr Arg His 
620 625 630 

TGG ACT TCC CTG AGC AAC TTA GTG GCA TCC TTA TTA AAC TCC ATG AAG 2454 
Trp Thr Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys 
635 640 645 

TCC ATC GCT TCG CTG TTG CTT CTG CTT TTT CTC TTC ATT ATC ATC TTT 2502 
Ser He Ala Ser Leu Leu Leu Leu Leu Phe Leu Phe He He He Phe 
650 655 660 

TCC TTG CTT GGG ATG CAG CTG TTT GGC GGC AAG TTT AAT TTT GAT GAA 2 550 

Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu 
665 670 675 680 

ACG CAA ACC AAG CGG AGC ACC TTT GAC AAT TTC CCT CAA GCA CTT CTC 2598 
Thr Gin Thr Lys Arg Ser Thr Phe Asp Asn Phe Pro Gin Ala Leu Leu 
685 690 695 

ACA GTG TTC CAG ATC CTG ACA GGC GAA GAC TGG AAT GCT GTG ATG TAC 2646 
Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr 
700 705 710 

GAT GGC ATC ATG GCT TAC GGG GGC CCA TCC TCT TCA GGA ATG ATC GTC 2694 
Asp Gly He Met Ala Tyr Gly Gly Pro Ser Ser Ser Gly Met He Val 
715 * 720 725 

TGC ATC TAC TTC ATC ATC CTC TTC ATT TGT GGT AAC TAT ATT CTA CTG 2742 
Cys He Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He Leu Leu 
730 735 740 

AAT GTC TTC TTG GCC ATC GCT GTA GAC AAT TTG GCT GAT GCT GAA AGT 2790 
Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala Glu Ser 
745 750 755 760 

CTG AAC ACT GCT CAG AAA GAA GAA GCG GAA GAA AAG GAG AGG AAA AAG 2838 
Leu Asn Thr Ala Gin Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys 
765 770 775 

ATT GCC AGA AAA GAG AGC CTA GAA AAT AAA AAG AAC AAC AAA CCA GAA 2886 
He Ala Arg Lys Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu 
780 785 790 

GTC AAC CAG ATA GCC AAC AGT GAC AAC AAG GTT ACA ATT GAT GAC TAT 2934 
Val Asn Gin He Ala Asn Ser Asp Asn Lys Val Thr He Asp Asp Tyr 
795 800 805 



BNSDOCID: <WO 9504822A1_I_> 



WO 95/04&22 



PCT/US94/09230 



-120- 



2982 



AGA GAA GAG GAT GAA GAC AAG GAC CCC TAT CCG CCT TGC GAT GTG CCA 
Arg Glu Glu Asp Glu Asp Lys Asp Pro Tyr Pro Pro Cys Asp Val Pro 
810 815 820 

GTA GGG GAA GAG GAA GAG GAA GAG GAG GAG GAT GAA CCT GAG GTT CCT 303 0 

Val Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro 
825 830 835 840 

GCC GGA CCC CGT CCT CGA AGG ATC TCG GAG TTG AAC ATG AAG GAA AAA 3078 
Ala Gly Pro Arg Pro Arg Arg He Ser Glu Leu Asn Met Lys Glu Lys 
845 850 855 

ATT GCC CCC ATC CCT GAA GGG AGC GCT TTC TTC ATT CTT AGC AAG ACC 3126 
lie Ala Pro He Pro Glu Gly Ser Ala Phe Phe He Leu Ser Lys Thr 
860 865 870 

AAC CCG ATC CGC GTA GGC TGC CAC AAG CTC ATC AAC CAC CAC ATC TTC 3174 
Asn Pro He Arg Val Gly Cys His Lys Leu He Asn His His He Phe 
875 880 885 

ACC AAC CTC ATC CTT GTC TTC ATC ATG CTG AGC AGT GCT GCC CTG GCC 3222 
Thr Asn Leu He Leu Val Phe He Met Leu Ser Ser Ala Ala Leu Ala 
890 895 900 

GCA GAG GAC CCC ATC CGC AGC CAC TCC TTC CGG AAC ACG ATA CTG GGT 3270 
Ala Glu Asp Pro He Arg Ser His Ser Phe Arg Asn Thr He Leu Gly 
905 910 915 9 2 o 

TAC TTT GAC TAT GCC TTC ACA GCC ATC TTT ACT GTT GAG ATC CTG TTG 3318 
Tyr Phe Asp Tyr Ala Phe Thr Ala He Phe Thr Val Glu He Leu Leu 
925 930 935 

AAG ATG ACA ACT TTT GGA GCT TTC CTC CAC AAA GGG GCC TTC TGC AGG 336 6 

Lys Met Thr Thr Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg 
940 945 950 

AAC TAC TTC AAT TTG CTG GAT ATG CTG GTG GTT GGG GTG TCT CTG GTG 
Asn Tyr Phe Asn Leu Leu Asp Met Leu Val Val Gly Val Ser Leu Val 
955 960 965 

TCA TTT GGG ATT CAA TCC AGT GCC ATC TCC GTT GTG AAG ATT CTG AGG 3462 
Ser Phe Gly He Gin Ser Ser Ala He Ser Val Val Lys He Leu Arg 
970 975 980 

GTC TTA AGG GTC CTG CGT CCC CTC AGG GCC ATC AAC AGA GCA AAA GGA 3 510 

Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg Ala Lys Gly 
985 990 995 1000 

CTT AAG CAC GTG GTC CAG TGC GTC TTC GTG GCC ATC CGG ACC ATC GGC 3558 
Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg Thr He Gly 
1005 1010 ~ 1015 

AAC ATC ATG ATC GTC ACC ACC CTC CTG CAG TTC ATG TTT GCC TGT ATC 3606 
Asn He Met He Val Thr Thr Leu Leu Gin Phe Met Phe Ala Cys He 
1020 1025 1030 
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GGG GTC CAG TTG TTC AAG GGG AAG TTC TAT CGC TGT ACG GAT GAA GCC 3654 
Gly Val Gin Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 
1035 1040 1045 

AAA AGT AAC CCT GAA GAA TGC AGG GGA CTT TTC ATC CTC TAC AAG GAT 3702 
Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe lie Leu Tyr Lys Asp 
1050 1055 1060 

GGG GAT GTT GAC AGT CCT GTG GTC CGT GAA CGG ATC TGG CAA AAC AGT 3750 
Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg lie Trp Gin Asn Ser 
1065 1070 1075 1080 

GAT TTC AAC TTC GAC AAC GTC CTC TCT GCT ATG ATG GCG CTC TTC ACA 3 798 

Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala Leu Phe Thr 
1085 1090 1095 

GTC TCC ACG TTT GAG GGC TGG CCT GCG TTG CTG TAT AAA GCC ATC GAC 3846 
Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr Lys Ala lie Asp 
1100 1105 1110 

TCG AAT GGA GAG AAC ATC GGC CCA ATC TAC AAC CAC CGC GTG GAG ATC 3894 
Ser Asn Gly Glu Asn lie Gly Pro lie Tyr Asn His Arg Val Glu lie 
1115 1120 1125 

TCC ATC TTC TTC ATC ATC TAC ATC ATC ATT GTA GCT TTC TTC ATG ATG 3942 
Ser lie Phe Phe lie lie Tyr lie He He Val Ala Phe Phe Met Met 
1130 1135 1140 

AAC ATC TTT GTG GGC TTT GTC ATC GTT ACA TTT CAG GAA CAA GGA GAA 3990 
Asn He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu 
1145 1150 1155 1160 

AAA GAG TAT AAG AAC TGT GAG CTG GAC AAA AAT CAG CGT CAG TGT GTT 4038 
Lys Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg Gin Cys Val 
1165 1170 1175 

GAA TAC GCC TTG AAA GCA CGT CCC TTG CGG AGA TAC ATC CCC AAA AAC 4086 
Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn 
1180 1185 1190 

CCC TAC CAG TAC AAG TTC TGG TAC GTG GTG AAC TCT TCG CCT TTC GAA 4134 
Pro Tyr Gin Tyr Lys Phe Trp Tyr Val Val Asn Ser Ser Pro Phe Glu 
1195 " 1200 1205 

TAC ATG ATG TTT GTC CTC ATC ATG CTC AAC ACA CTC TGC TTG GCC ATG 4182 
Tyr Met Met Phe Val Leu He Met Leu Asn Thr Leu Cys Leu Ala Met 
1210 1215 1220 

CAG CAC TAC GAG CAG TCC AAG ATG TTC AAT GAT GCC ATG GAC ATT CTG 4230 
Gin His Tyr Glu Gin Ser Lys Met Phe Asn Asp Ala Met Asp Ile # Leu 
1225 1230 1235 1240 

AAC ATG GTC TTC ACC GGG GTG TTC ACC GTC GAG ATG GTT TTG AAA GTC 4278 
Asn Met Val Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val 
1245 1250 1255 
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4326 



4374 



4422 



ATC GCA TTT AAG CCT AAG GGG TAT TTT AGT GAC GCC TGG AAC ACG TTT 
He Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe 
1260 1265 1270 

GAC TCC CTC ATC GTA ATC GGC AGC ATT ATA GAC GTG GCC CTC AGC GAA 
Asp Ser Leu He Val He Gly Ser He He Asp Val Ala Leu Ser Glu 
1275 1280 1285 

GCA GAC CCA ACT GAA AGT GAA AAT GTC CCT GTC CCA ACT GCT ACA CCT 
Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr Pro 
1290 1295 1300 

GGG AAC TCT GAA GAG AGC AAT AGA ATC TCC ATC ACC TTT TTC CGT CTT 4470 
Gly Asn Ser Glu Glu Ser Asn Arg He Ser He Thr Phe Phe Arg Leu 
1305 1310 1315 1320 

TTC CGA GTG ATG CGA TTG GTG AAG CTT CTC AGC AGG GGG GAA GGC ATC 4518 
Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He 
1325 1330 1335 

CGG ACA TTG CTG TGG ACT TTT ATT AAG TTC TTT CAG GCG CTC CCG TAT 4566 
Arg Thr Leu Leu Trp Thr Phe He Lys Phe Phe Gin Ala Leu Pro Tyr 
1340 1345 1350 

GTG GCC CTC CTC ATA GCC ATG CTG TTC TTC ATC TAT GCG GTC ATT GGC 4614 
Val Ala Leu Leu He Ala Met Leu Phe Phe He Tyr Ala Val He Gly 
1355 1360 1365 

ATG CAG ATG TTT GGG AAA GTT GCC ATG AGA GAT AAC AAC CAG ATC AAT 4662 
Met Gin Met Phe Gly Lys Val Ala Met Arg Asp Asn Asn Gin He Asn 
!370 1375 1380 

AGG AAC AAT AAC TTC CAG ACG TTT CCC CAG GCG GTG CTG CTG CTC TTC 4710 
Arg Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe 
1385 1390 1395 1400 

AGG TGT GCA ACA GGT GAG GCC TGG CAG GAG ATC ATG CTG GCC TGT CTC 4758 
Arg Cys Ala Thr Gly Glu Ala Trp Gin Glu He Met Leu Ala Cys Leu 
1405 1410 1415 

CCA GGG AAG CTC TGT GAC CCT GAG TCA GAT TAC AAC CCC GGG GAG GAG 4 8 06 

Pro Gly Lys Leu Cys Asp Pro Glu Ser Asp Tyr Asn Pro Gly Glu Glu 
1420 1425 1430 

CAT ACA TGT GGG AGC AAC TTT GCC ATT GTC TAT TTC ATC AGT TTT TAC 4 854 

Hxs Thr Cys Gly Ser Asn Phe Ala He Val Tyr Phe He Ser Phe Tyr 
1435 1440 * 1445 

ATG CTC TGT GCA TTT CTG ATC ATC AAT CTG TTT GTG GCT GTC ATC ATG 4902 
Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He Met 
1450 1455 1460 

GAT AAT TTC GAC TAT CTG ACC CGG GAC TGG TCT ATT TTG GGG CCT CAC 4950 
Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro His 
X465 1470 1475 1480 
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CAT TTA GAT GAA TTC AAA AGA ATA TGG TCA GAA TAT GAC CCT GAG GCA 4998 
His Leu Asp Glu Phe Lys Arg lie Trp Ser Glu Tyr Asp Pro Glu Ala 
1485 1490 1495 

AAG GGA AGG ATA AAA CAC CTT GAT GTG GTC ACT CTG CTT CGA CGC ATC 5046 
Lys Gly Arg lie Lys His Leu Asp Val Val Thr Leu Leu Arg Arg lie 
1500 1505 1510 

CAG CCT CCC CTG GGG TTT GGG AAG TTA TGT CCA CAC AGG GTA GCG TGC 5094 
Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 
1515 1520 1525 

AAG AGA TTA GTT GCC ATG AAC ATG CCT CTC AAC AGT GAC GGG ACA GTC 5142 
Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 
1530 1535 1540 

ATG TTT AAT GCA ACC CTG TTT GCT TTG GTT CGA ACG GCT CTT AAG ATC 5190 
Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys lie 
1545 1550 1555 1560 

AAG ACC GAA GGG AAC CTG GAG CAA GCT AAT GAA GAA CTT CGG GCT GTG 523 8 

Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala Val 
1565 1570 1575 

ATA AAG AAA ATT TGG AAG AAA ACC AGC ATG AAA TTA CTT GAC CAA GTT 52 8 6 

lie Lys Lys lie Trp Lys Lys Thr Ser Met Lys Leu Leu Asp Gin Val 
1580 1585 1590 

GTC CCT CCA GCT GGT GAT GAT GAG GTA ACC GTG GGG AAG TTC TAT GCC 53 34 

Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala 
1595 " 1600 1605 

ACT TTC CTG ATA CAG GAC TAC TTT AGG AAA TTC AAG AAA CGG AAA GAA 53 82 

Thr Phe Leu lie Gin Asp Tyr Phe Arg Lys Phe Lys Lys Arg Lys Glu 
1610 1615 1620 

CAA GGA CTG GTG GGA AAG TAC CCT GCG AAG AAC ACC ACA ATT GCC CTA 543 0 

Gin Gly Leu Val Gly Lys Tyr Pro Ala Lys Asn Thr Thr lie Ala Leu 
1625 1630 1635 1640 

CAG GCG GGA TTA AGG ACA CTG CAT GAC ATT GGG CCA GAA ATC CGG CGT 54 78 

Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1645 1650 1655 

GCT ATA TCG TGT GAT TTG CAA GAT GAC GAG CCT GAG GAA ACA AAA CGA 5526 
Ala He Ser Cys Asp Leu Gin Asp Asp Glu Pro Glu Glu Thr Lys Arg 
1660 1665 1670 

GAA GAA GAA GAT GAT GTG TTC AAA AGA AAT GGT GCC CTG CTT GGA AAC 5574 
Glu Glu Glu Asp Asp Val Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn 
1675 1680 1685 

CAT GTC AAT CAT GTT AAT AGT GAT AGG AGA GAT TCC CTT CAG CAG ACC 5622 
His Val Asn His Val Asn Ser Asp Arg Arg Asp Ser Leu Gin Gin Thr 
1690 1695 1700 
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^ ££ C £5° C ^ C CGT CCC CTG ^ GTC ^ AGG CCT TCA ATT CCA CCT 5670 
Asn Thr Thr His Arg Pro Leu His Val Gin Arg Pro Ser III Pro Pro 

1705 17 1<> 1715 1720 

GCA AGT GAT ACT GAG AAA CCG CTG TTT CCT CCA GCA GGA AAT TCG GTG 
Ala Ser Asp Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val 
17 25 1730 1735 

n?,I v AT ^ C 5^ T ^ T AAC CAT AAT TCC ATA GGA AAG CAA GTT CCC ACC 
Cys His Asn His His Asn His Asn Ser He Gly Lys Gin Val Pro Thr 



1740 i74 5 



1750 



S« JS iS ti C ^ T T CTC GCC ATG TCC AAA GCT GCC CAT 

Ser Thr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 
1755 i7 60 



1765 



nit ^ ^ GG CCC AGC ATT GGG AAC CTT GAG CAT GTG TCT GAA AAT GGG 
Gly ****** Pro Ser Ile G1 y Asn Leu Glu His Val Ser Glu Asn Sly 
1770 1775 1780 

CAT CAT TCT TCC CAC AAG CAT GAC CGG GAG CCT CAG AGA AGG TCC AGT 
His His Ser Ser His Lys His Asp Arg Glu Pro Gin Arg Arg Ser JE 
1785 17 *° 1795 ~ 1800 

GTG AAA AGA ACC CGC TAT TAT GAA ACT TAC ATT AGG TCC GAC TCA GGA 
Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr He Arg Ser Asp Ser Gly 
1 805 leio ~ 1815 

aol ^ S^ G CTC CCA ACT ATT TGC CGG GAA GAC CCA GAG ATA CAT GGC 
Asp Glu Gin Leu Pro Thr He Cys Arg Glu Asp Pro Glu He His Gly 
182 ° 1825 1830 

TAT TTC AGG GAC CCC CAC TGC TTG GGG GAG CAG GAG TAT TTC AGT AGT 
Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gin Glu Tyr Phe Ser Ser 
1835 i 8 4o i 845 

GAG GAA TGC TAC GAG GAT GAC AGC TCG CCC ACC TGG AGC AGG CAA AAC 
Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro Thr Trp Ser Arg Gin Asn 
1850 1855 i860 

TAT GGC TAC TAC AGC AGA TAC CCA GGC AGA AAC ATC GAC TCT GAG AGG 
Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly Arg Asn He Asp Ser Glu Arg 
1865 1870 1875 1880 

CCC CGA GGC TAC CAT CAT CCC CAA GGA TTC TTG GAG GAC GAT GAC TCG 
Pro Arg Gly Tyr His His Pro Gin Gly Phe Leu Glu Asp Asp Asp Ser 
1885 1890 1895 

CCC GTT TGC TAT GAT TCA CGG AGA TCT CCA AGG AGA CGC CTA CTA CCT 
Pro Val cys Tyr Asp Ser Arg Arg Ser Pro Arg Arg Arg Leu Leu Pro 
1900 isos ' 1910 

CCC ACC CCA GCA TCC CAC CGG AGA TCC TCC TTC AAC TTT GAG TGC CTG 
Pro Thr Pro Ala Ser His Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu 
1915 1920 1925 



5718 



5766 



5814 



5862 



5910 



5958 



6006 



6054 



6102 



6150 



6198 



6246 



6294 
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CGC CGG CAG AGC AGC CAG GAA GAG GTC CCG TCG TCT CCC ATC TTC CCC 6342 
Arg Arg Gin Ser Ser Gin Glu Glu Val Pro Ser Ser Pro He Phe Pro 
1930 1935 1940 

CAT CGC ACG GCC CTG CCT CTG CAT CTA ATG CAG CAA CAG ATC ATG GCA 63 90 

His Arg Thr Ala Leu Pro Leu His Leu Met Gin Gin Gin He Met Ala 
1945 1950 1955 1960 

GTT GCC GGC CTA GAT TCA AGT AAA GCC CAG AAG TAC TCA CCG AGT CAC 6438 
Val Ala Gly Leu Asp Ser Ser Lys Ala Gin Lys Tyr Ser Pro Ser His 
1965 1970 1975 

TCG ACC CGG TCG TGG GCC ACC CCT CCA GCA ACC CCT CCC TAC CGG GAC 6486 
Ser Thr Arg Ser Trp Ala Thr Pro Pro Ala Thr Pro Pro Tyr Arg Asp 
1980 1985 1990 

TGG ACA CCG TGC TAC ACC CCC CTG ATC CAA GTG GAG CAG TCA GAG GCC 6534 
Trp Thr Pro Cys Tyr Thr Pro Leu He Gin Val Glu Gin Ser Glu Ala 
1995 2000 2005 

CTG GAC CAG GTG AAC GGC AGC CTG CCG TCC CTG CAC CGC AGC TCC TGG 6582 
Leu Asp Gin Val Asn Gly Ser Leu Pro Ser Leu His Arg Ser Ser Trp 
2010 2015 2020 

TAC ACA GAC GAG CCC GAC ATC TCC TAC CGG ACT TTC ACA CCA GCC AGC 6630 
Tyr Thr Asp Glu Pro Asp He Ser Tyr Arg Thr Phe Thr Pro Ala Ser 
2025 * 2030 2035 2040 

CTG ACT GTC CCC AGC AGC TTC CGG AAC AAA AAC AGC GAC AAG CAG AGG 667 8 

Leu Thr Val Pro Ser Ser Phe Arg Asn Lys Asn Ser Asp Lys Gin Arg 
2045 2050 2055 

AGT GCG GAC AGC TTG GTG GAG GCA GTC CTG ATA TCC GAA GGC TTG GGA 672 6 

Ser Ala Asp Ser Leu Val Glu Ala Val Leu He Ser Glu Gly Leu Gly 
2060 2065 2070 

CGC TAT GCA AGG GAC CCA AAA TTT GTG TCA GCA ACA AAA CAC GAA ATC 6 774 

Arg Tyr Ala Arg Asp Pro Lys Phe Val Ser Ala Thr Lys His Glu He 
2075 2080 2085 

GCT GAT GCC TGT GAC CTC ACC ATC GAC GAG ATG GAG AGT GCA GCC AGC 6 822 

Ala Asp Ala Cys Asp Leu Thr He Asp Glu Met Glu Ser Ala Ala Ser 
2090 2095 2100 

ACC CTG CTT AAT GGG AAC GTG CGT CCC CGA GCC AAC GGG GAT GTG GGC 6 87 0 

Thr Leu Leu Asn Gly Asn Val Arg Pro Arg Ala Asn Gly Asp Val Gly 
2105 2110 2115 2120 

CCC CTC TCA CAC CGG CAG GAC TAT GAG CTA CAG GAC TTT GGT CCT GGC 6 918 

Pro Leu Ser His Arg Gin Asp Tyr Glu Leu Gin Asp Phe Gly Pro Gly 
2125 2130 2135 

TAC AGC GAC GAA GAG CCA GAC CCT GGG AGG GAT GAG GAG GAC CTG GCG 6966 
Tyr Ser Asp Glu Glu Pro Asp Pro Gly Arg Asp Glu Glu Asp Leu Ala 
2140 2145 2150 
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GAT GAA ATG ATA TGC ATC ACC ACC TTG TAGCCCCCAG CGAGGGGCAG 7013 
Asp Glu Met lie Cys He Thr Thr Leu 
2155 2160 

ACTGGCTCTG GCCTCAGGTG GGGCGCAGGA GAGCCAGGGG AAAAGTGCCT CATAGTTAGG 7073 

AAAGTTTAGG CACTAGTTGG GAGTAATATT CAATTAATTA GACTTTTGTA TAAGAGATGT 7133 

CATGCCTCAA GAAAGCCATA AACCTGGTAG GAACAGGTCC CAAGCGGTTG AGCCTGGCAG 7193 

AGTACCATGC GCTCGGCCCC AGCTGCAGGA AACAGCAGGC CCCGCCCTCT CACAGAGGAT 7253 

GGGTGAGGAG GCCAGACCTG CCCTGCCCCA TTGTCCAGAT GGGCACTGCT GTGGAGTCTG 7313 

CTTCTCCCAT GTACCAGGGC ACCAGGCCCA CCCAACTGAA GGCATGGCGG CGGGGTGCAG 7373 

GGGAAAGTTA AAGGTGATGA CGATCATCAC ACCTGTGTCG TTACCTCAGC CATCGGTCTA 7433 

GCATATCAGT CACTGGGCCC AACATATCCA TTTTTAAACC CTTTCCCCCA AATACACTGC 7493 

GTCCTGGTTC CTGTTTAGCT GTTCTGAAAT ACGGTGTGTA AGTAAGTCAG AACCCAGCTA 7553 

CCAGTGATTA TTGCGAGGGC AATGGGACCT CATAAATAAG GTTTTCTGTG ATGTGACGCC 7613 

AGTTTACATA AGAGAATATC AC 7635 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .102 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 104 

(D) OTHER INFORMATION: /note«= "A 104-nucleotide 
alternative exon of alpha - ID. " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GTA AAT GAT GCG ATA GGA TGG GAA TGG CCA TGG GTG TAT TTT GTT AGT 4 8 

Val Asn Asp Ala He Gly Trp Glu Trp Pro Trp Val Tyr Phe Val Ser 
1 5 10 15 

CTG ATC ATC CTT GGC TCA TTT TTC GTC CTT AAC CTG GTT CTT GGT GTC 96 
Leu He He Leu Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
20 25 30 
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CTT AGT GG 104 
Leu Ser 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GTC AAT GAG AAT ACG AGG ATG TAC ATT CCA GAG GAA AAC CAC CAA 4 8 

Met Val Asn Glu Asn Thr Arg Met Tyr lie Pro Glu Glu Asn His Gin 
1 5 10 15 

GGT TCC AAC TAT GGG AGC CCA CGC CCC GCC CAT GCC AAC ATG AAT GCC 96 
Gly Ser Asn Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Met Asn Ala 
20 25 30 

AAT GCG GCA GCG GGG CTG GCC CCT GAG CAC ATC CCC ACC CCG GGG GCT 144 
Asn Ala Ala Ala Gly Leu Ala Pro Glu His lie Pro Thr Pro Gly Ala 
35 40 45 

GCC CTG TCG TGG CAG GCG GCC ATC GAC GCA GCC CGG CAG GCT AAG CTG 192 
Ala Leu Ser Trp Gin Ala Ala lie Asp Ala Ala Arg Gin Ala Lys Leu 
50 55 60 

ATG GGC AGC GCT GGC AAT GCG ACC ATC TCC ACA GTC AGC TCC ACG CAG 24 0 

Met Gly Ser Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser Thr Gin 
65 70 75 80 

CGG AAG CGC CAG CAA TAT GGG AAA CCC AAG AAG CAG GGC AGC ACC ACG 288 
Arg Lys Arg Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser Thr Thr 
85 90 95 

GCC ACA CGC CCG CCC CGA GCC CTG CTC TGC CTG ACC CTG AAG AAC CCC 3 36 

Ala Thr Arg Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro 
100 " 105 110 

ATC CGG AGG GCC TGC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA 384 
He Arg Arg Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He 
115 120 125 

ATT ATT TTA CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT 432 
He He Leu Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr 
130 135 140 

ATT CCC TTT CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA 4 80 
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Ile Pro Phe Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu 
145 150 155 160 

CGA GTG GAA TAT CTC TTT CTC ATA ATT TTT ACG GTG GAA GCG TTT TTA 528 
Arg Val Glu Tyr Leu Phe Leu He He Phe Thr Val Glu Ala Phe Leu 
165 170 175 

AAA GTA ATC GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAC CTC CGC 576 
Lys Val He Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Aro 
180 185 190 

AAC GGC TGG AAC CTA CTA GAT TTT ATA ATT GTG GTT GTG GGG CTT TTT 
Asn Gly Trp Asn Leu Leu Asp Phe He He Val Val Val Gly Leu Phe 
1^5 200 205 

AGT GCA ATT TTA GAA CAA GCA ACC AAA GCA GAT GGG GCA AAC GCT CTC 672 
Ser Ala He Leu Glu Gin Ala Thr Lys Ala Asp Gly Ala Asn Ala Leu 
210 215 220 

GGA GGG AAA GGG GCC GGA TTT GAT GTG AAG GCG CTG AGG GCC TTC CGC 720 
Gly Gly Lys Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg 
225 230 235 240 

GTG CTG CGC CCC CTG CGG CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG 768 
Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val 
245 250 255 

GTC CTG AAT TCC ATC ATC AAG GCC ATG GTC CCC CTG CTG CAC ATC GCC 816 
Val Leu Asn Ser He He Lys Ala Met Val Pro Leu Leu His He Ala 
260 265 270 

CTG CTT GTG CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG 864 
Leu Leu Val Leu Phe Val He He He Tyr Ala He He Gly Leu Glu 
275 280 285 

CTC TTC ATG GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA 912 
Leu Phe Met Gly Lys Met His Lys Thr Cys Tyr Asn Gin Glu Gly He 
290 295 300 

GCA GAT GTT CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG 960 
Ala Asp Val Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr 
305 310 315 320 

GGC CAC GGG CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG 1008 
Gly His Gly Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp 
325 330 335 

GAT GGT CCC AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC 1056 
Asp Gly Pro Lys His Gly He Thr Asn Phe Asp Asn Phe Ala Phe Ala 
340 345 350 

ATG CTC ACG GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG 1104 
Met Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Trp Thr Asp Val 
355 360 365 

CTG TAC TGG GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT U52 
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Leu Tyr Trp Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp lie Tyr 
370 375 380 

TTT GTT ACA CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT 1200 
Phe Val Thr Leu He He He Gly Ser Phe Phe Val Leu Asn Leu Val 
385 390 395 400 

CTC GGT GTG CTT AGC GGA GAG TTT TCC AAA GAG AGG GAG AAG GCC AAG 1248 
Leu Gly Val Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala Lys 
405 410 415 

GCC CGG GGA GAT TTC CAG AAG CTG CGG GAG AAG CAG CAG CTA GAA GAG 1296 
Ala Arg Gly Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu 
420 425 430 

GAT CTC AAA GGC TAC CTG GAT TGG ATC ACT CAG GCC GAA GAC ATC GAT 1344 
Asp Leu Lys Gly Tyr Leu Asp Trp He Thr Gin Ala Glu Asp He Asp 
435 440 445 

CCT GAG AAT GAG GAC GAA GGC ATG GAT GAG GAG AAG CCC CGA AAC AGA 1392 
Pro Glu Asn Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg Asn Arg 
450 455 460 

GGC ACT CCG GCG GGC ATG CTT GAT CAG AAG AAA GGG AAG TTT GCT TGG 1440 
Gly Thr Pro Ala Gly Met Leu Asp Gin Lys Lys Gly Lys Phe Ala Trp 
465 470 475 480 

TTT AGT CAC TCC ACA GAA ACC CAT GTG AGC ATG CCC ACC AGT GAG ACC 1488 
Phe Ser His Ser Thr Glu Thr His Val Ser Met Pro Thr Ser Glu Thr 
485 490 495 

GAG TCC GTC AAC ACC GAA AAC GTG GCT GGA GGT GAC ATC GAG GGA GAA 1536 
Glu Ser Val Asn Thr Glu Asn Val Ala Gly Gly Asp He Glu Gly Glu 
500 505 510 

AAC TGC GGG GCC AGG CTG GCC CAC CGG ATC TCC AAG TCA AAG TTC AGC 1584 
Asn Cys Gly Ala Arg Leu Ala His Arg He Ser Lys Ser Lys Phe Ser 
515 520 525 

CGC TAC TGG CGC CGG TGG AAT CGG TTC TGC AGA AGG AAG TGC CGC GCC 1632 
Arg Tyr Trp Arg Arg Trp Asn Arg Phe Cys Arg Arg Lys Cys Arg Ala 
530 " 535 540 

GCA GTC AAG TCT AAT GTC TTC TAC TGG CTG GTG ATT TTC CTG GTG TTC 1680 
Ala Val Lys Ser Asn Val Phe Tyr Trp Leu Val He Phe Leu Val Phe 
545 550 555 560 

CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC CAG CCC AAC TGG 1728 
Leu Asn Thr Leu Thr He Ala Ser Glu His Tyr Asn Gin Pro Asn Trp 
565 570 575 

CTC ACA GAA GTC CAA GAC ACG GCA AAC AAG GCC CTG CTG GCC CTG TTC 1776 
Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu Leu Ala Leu Phe 
580 585 590 

ACG GCA GAG ATG CTC CTG AAG ATG TAC AGC CTG GGC CTG CAG GCC TAC 1824 
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Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr 
595 600 605 

TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC GTG TGT GGC GGC 1872 
Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly 
610 615 620 

ATC CTG GAG ACC ATC CTG GTG GAG ACC AAG ATC ATG TCC CCA CTG GGC 1920 
lie Leu Glu Thr lie Leu Val Glu Thr Lys lie Met Ser Pro Leu Gly 
625 630 635 640 

ATC TCC GTG CTC AGA TGC GTC CGG CTG CTG AGG ATT TTC AAG ATC ACG 1968 
He Ser Val Leu Arg Cys Val Arg Leu Leu Arg He Phe Lys He Thr 
645 650 655 

AGG TAC TGG AAC TCC TTG AGC AAC CTG GTG GCA TCC TTG CTG AAC TCT 2 016 

Arg Tyr Trp Asn Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser 
660 665 670 

GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC CTC TTC ATC ATC 2064 
Val Arg Ser He Ala Ser Leu Leu Leu Leu Leu Phe Leu Phe He He 
675 680 685 

ATC TTC TCC CTC CTG GGG ATG CAG CTC TTT GGA GGA AAG TTC AAC TTT 2112 
He Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe 
690 695 700 

GAT GAG ATG CAG ACC CGG AGG AGC ACA TTC GAT AAC TTC CCC CAG TCC 2160 
Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn Phe Pro Gin Ser 
705 710 715 720 

CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC TGG AAT TCG GTG 22 08 

Leu Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ser Val 
725 730 735 

ATG TAT GAT GGG ATC ATG GCT TAT GGG GGC CCC TCT TTT CCA GGG ATG 2256 
Met Tyr Asp Gly He Met Ala Tyr Gly Gly Pro Ser Phe Pro Gly Met 
740 745 750 

TTA GTC TGT ATT TAC TTC ATC ATC CTC TTC ATC TGT GGA AAC TAT ATC 23 04 

Leu Val Cys He Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He 
755 760 765 

CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC CTG GCT GAT GCT 2352 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala 
770 775 780 

GAG AGC CTC ACA TCT GCC CAA AAG GAG GAG GAA GAG GAG AAG GAG AGA 24 00 

Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu Glu Lys Glu Arg 
785 790 795 800 

AAG AAG CTG GCC AGG ACT GCC AGC CCA GAG AAG AAA CAA GAG TTG GTG 244 8 

Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys Lys Gin Glu Leu Val 
805 810 ' 815 

GAG AAG CCG GCA GTG GGG GAA TCC AAG GAG GAG AAG ATT GAG CTG AAA 24 96 
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Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lys lie Glu Leu Lys 
820 825 830 

TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC AAG ATC AAC ATG 2544 
Ser lie Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr Lys lie Asn Met 
835 840 845 

GAT GAC CTC CAG CCC AAT GAA AAT GAG GAT AAG AGC CCC TAC CCC AAC 2592 
Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser Pro Tyr Pro Asn 
850 855 860 

CCA GAA ACT ACA GGA GAA GAG GAT GAG GAG GAG CCA GAG ATG CCT GTC 2640 
Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro Glu Met Pro Val 
865 870 875 880 

GGC CCT CGC CCA CGA CCA CTC TCT GAG CTT CAC CTT AAG GAA AAG GCA 2688 
Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu His Leu Lys Glu Lys Ala 
885 890 895 

GTG CCC ATG CCA GAA GCC AGC GCG TTT TTC ATC TTC AGC TCT AAC AAC 2736 
Val Pro Met Pro Glu Ala Ser Ala Phe Phe He Phe Ser Ser Asn Asn 
900 905 910 

AGG TTT CGC CTC CAG TGC CAC CGC ATT GTC AAT GAC ACG ATC TTC ACC 2784 
Arg Phe Arg Leu Gin Cys His Arg He Val Asn Asp Thr He Phe Thr 
915 920 925 

AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT TCC CTG GCT GCT 2832 
Asn Leu He Leu Phe Phe He Leu Leu Ser Ser He Ser Leu Ala Ala 
930 935 940 

GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT ATT CTG TTT TAT 288 0 

Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His He Leu Phe Tyr 
945 950 955 960 

TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA ATT GCT CTG AAG 2928 
Phe Asp He Val Phe Thr Thr He Phe Thr He Glu He Ala Leu Lys 
965 970 975 

ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT TTC TGC CGG AAC 2976 
Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn 
980 985 990 

TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG TCC CTC ATC TCC 3024 
Tyr Phe Asn He Leu Asp Leu Leu Val Val Ser Val Ser Leu He Ser 
995 1000 1005 

TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG ATC TTG CGA GTC 3072 
Phe Gly He Gin Ser Ser Ala He Asn Val Val Lys He Leu Arg Val 
1010 1015 1020 

CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG GCC AAG GGG CTA 3120 
Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg Ala Lys Gly Leu 
1025 1030 ~ 1035 1040 

AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG ACC ATC GGG AAC 3168 
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Lys His Val Val Gin Cys Val Phe Val Ala lie Arg Thr lie Gly Asn 
1045 1050 " 1055 

ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT GCC TGC ATC GGG 3216 
lie Val lie Val Thr Thr Leu Leu Gin Phe Met Phe Ala Cys lie Gly 
1060 1065 1070 

GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA GAC AGT TCC AAG 3264 
Val Gin Leu Phe Lys Gly Lys Leu Tyr Thr Cys Ser Asp Ser Ser Lys 
1075 1080 1085 

CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG TAC AAA GAC GGG 3312 
Gin Thr Glu Ala Glu Cys Lys Gly Asn Tyr lie Thr Tyr Lys Asp Gly 
1090 1095 1100 

GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG GAG AAC AGC AAG 3360 
Glu Val Asp His Pro lie lie Gin Pro Arg Ser Trp Glu Asn Ser Lys 
1105 HIO ins 1120 

TTT GAC TTT GAC AAT GTT CTG GCA GCC ATG ATG GCC CTC TTC ACC GTC 34 08 

Phe Asp Phe Asp Asn Val Leu Ala Ala Met Met Ala Leu Phe Thr Val 
1125 H30 1135 

TCC ACC TTC GAA GGG TGG CCA GAG CTG CTG TAC CGC TCC ATC GAC TCC 34 56 

Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Tyr Arg Ser He Asp Ser 
1140 H45 ~ H50 

CAC ACG GAA GAC AAG GGC CCC ATC TAC AAC TAC CGT GTG GAG ATC TCC 3 504 

Hxs Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg Val Glu He Ser 
1155 H60 H65 

ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC TTC ATG ATG AAC 3 552 

He Phe Phe He He Tyr He He He He Ala Phe Phe Met Met Asn 
1170 H75 H80 

ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG CAG GGG GAG CAG 3600 
He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu Gin 
1185 H90 H95 1200 

GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA CAG TGC GTG GAA 364 8 

Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg Gin Cys Val Glu 
1205 1210 1215 

TAC GCC CTC AAG GCC CGG CCC CTG CGG AGG TAC ATC CCC AAG AAC CAG 36 96 

Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn Gin 
1220 1225 1230 

CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC TAC TTC GAG TAC 3 744 

His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr Tyr Phe Glu Tyr 
1235 1240 1245 

CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC CTG GCC ATG CAG 3792 
Leu Met Phe Val Leu He Leu Leu Asn Thr He Cys Leu Ala Met Gin 
1250 1255 1260 

CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG AAC ATC CTC AAC 384 0 
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His Tyr Gly Gin Ser Cys Leu Phe Lys He Ala Met Asn He Leu Asn 
1265 1270 1275 1280 

ATG CTC TTC ACT GGC CTC TTC ACC GTG GAG ATG ATC CTG AAG CTC ATT 3888 
Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met He Leu Lys Leu He 
1285 1290 1295 

GCC TTC AAA CCC AAG GGT TAC TTT AGT GAT CCC TGG AAT GTT TTT GAC 3 936 

Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Pro Trp Asn Val Phe Asp 
1300 1305 1310 

TTC CTC ATC GTA ATT GGC AGC ATA ATT GAC GTC ATT CTC AGT GAG ACT 3984 
Phe Leu He Val He Gly Ser He He Asp Val He Leu Ser Glu Thr 
1315 1320 1325 

AAT CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG AAC GCA GAG GAA 4 032 

Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met Asn Ala Glu Glu 
1330 1335 1340 

AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC CGG GTC ATG CGT 4 080 

Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe Arg Val Met Arg 
1345 1350 1355 1360 

CTG GTG AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG ACG CTG CTG TGG 4128 
Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg Thr Leu Leu Trp 
1365 1370 1375 

ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG GCC CTC CTG ATC 4176 
Thr Phe He Lys Ser Phe Gin Ala Leu Pro Tyr Val Ala Leu Leu He 
1380 1385 1390 

GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG CAG GTG TTT GGG 4224 
Val Met Leu Phe Phe He Tyr Ala Val He Gly Met Gin Val Phe Gly 
1395 1400 1405 

AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG AAC AAC AAC TTT 4272 
Lys He Ala Leu Asn Asp Thr Thr Glu He Asn Arg Asn Asn Asn Phe 
1410 1415 1420 

CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AGG TGT GCC ACC GGG 4320 
Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly 
1425 1430 1435 1440 

GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA GGC AAG AAG TGT 4368 
Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro Gly Lys Lys Cys 
1445 1450 1455 

GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT GAA ACA CCC TGT 4416 
Ala Pro Glu Ser Glu Pro Ser Asn Ser Thr Glu Gly Glu Thr Pro Cys 
1460 1465 1470 

GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC TAC ATG CTC TGT 4464 
Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe Tyr Met Leu Cys 
1475 1480 1485 

GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC ATG GAC AAC TTT 4512 
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Ala Xlo^ 116 116 ASn ^^ c Phe Val Ala Val Ile Met As P Asn Phe 

1495 1500 

GAC TAC CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC CAC CAC CTG GAT 
Asp Tyr Leu Thr Arg Asp Trp Ser lie Leu Gly Pro His His Leu Asp 
1505 1510 1515 i 5 20 

GAG TTT AAA AGA ATC TGG GCA GAG TAT GAC CCT GAA GCC AAG GGT CGT 
Glu Phe Lys Arg lie Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arq 
15 25 1530 1535 

ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG ATT CAG CCG CCA 
Ile Lys His Leu Asp Val Val Thr Leu Leu Arg Arg lie Gin Pro Pro 
1540 1545 1550 

rfl CTG TGC CCT °^ CGC GTG GCT TGC AAA CGC CTG 

Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys Lys Arg Leu 
1555 1560 1565 

vl? c CC 5*? ^ C ATG CCT CTG ^ AGC GAC GGG ACA GTC ATG TTC AAT 
ft™ Asn Met Pro Leu A* 311 Ser Asp Giy Thr Val Met Phe Asn 
1570 1575 1580 

GCC ACC CTG TTT GCC CTG GTC AGG ACG GCC CTG AGG ATC AAA ACA GAA 
Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg lie Lys Thr X£ 
1585 1590 1595 1600 

rit r CTA ^ G ?° GAG GAG CTG CGG GCG ATC ATC AAG AAG 

Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala Ile Ile Lys Lys 
1605 1610 1615 

ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG GTG GTG CCC CCT 
lie Trp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin Val Val Pro Pro 
162 ° 1625 1630 

GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAC GCC ACG TTC CTG 
Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu 
1^35 1640 1645 

ATC CAG GAG TAC TTC CGG AAG TTC AAG AAG CGC AAA GAG CAG GGC CTT 

?«* Glu Phe ^ Lys Phe Lys Lys Arg Lys Glu Gin Gly Leu 

1650 1655 1660 

GTG GGC AAG CCC TCC CAG AGG AAC GCG CTG TCT CTG CAG GCT GGC TTG 
Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu Gin Ala Gly Leu 
1665 1670 1675 1680 

CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG GCC ATC TCT GGA 
Arg Thr Leu His Asp Ile Gly Pro Glu Ile Arg Arg Ala Ile Ser Gly 
1685 1690 1695 

GAT CTC ACC GCT GAG GAG GAG CTG GAC AAG GCC ATG AAG GAG GCT GTG 
Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met Lys Glu Ala Val 
1700 1705 1710 

TCC GCT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC GGT GGC CTG TTC 



4560 



4608 



4656 



4704 



4752 



4800 



4848 



4896 



4944 



4992 



5040 



5088 



5136 



5184 
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Ser Ala Ala Ser Glu Asp Asp lie Phe Arg Arg Ala Gly Gly Leu Phe 
1715 1720 1725 

GGC AAC CAC GTC AGC TAC TAC CAA AGC GAC GGC CGG AGC GCC TTC CCC 5232 
Gly Asn His Val Ser Tyr Tyr Gin Ser Asp Gly Arg Ser Ala Phe Pro 
1730 1735 1740 

CAG ACC TTC ACC ACT CAG CGC CCG CTG CAC ATC AAC AAG GCG GGC AGC 528 0 

Gin Thr Phe Thr Thr Gin Arg Pro Leu His He Asn Lys Ala Gly Ser 
1745 1750 1755 1760 

AGC CAG GGC GAC ACT GAG TCG CCA TCC CAC GAG AAG CTG GTG GAC TCC 5328 
Ser Gin Gly Asp Thr Glu Ser Pro Ser His Glu Lys Leu Val Asp Ser 
1765 1770 1775 

ACC TTC ACC CCG AGC AGC TAC TCG TCC ACC GGC TCC AAC GCC AAC ATC 5376 
Thr Phe Thr Pro Ser Ser Tyr Ser Ser Thr Gly Ser Asn Ala Asn He 
1780 1785 1790 

AAC AAC GCC AAC AAC ACC GCC CTG GGT CGC CTC CCT CGC CCC GCC GGC 5424 
Asn Asn Ala Asn Asn Thr Ala Leu Gly Arg Leu Pro Arg Pro Ala Gly 
1795 1800 1805 

TAC CCC AGC ACA GTC AGC ACT GTG GAG GGC CAC GGG CCC CCC TTG TCC 5472 
Tyr Pro Ser Thr Val Ser Thr Val Glu Gly His Gly Pro Pro Leu Ser 
1810 1815 1820 

CCT GCC ATC CGG GTG CAG GAG GTG GCG TGG AAG CTC AGC TCC AAC AGG 552 0 

Pro Ala He Arg Val Gin Glu Val Ala Trp Lys Leu Ser Ser Asn Arg 
1825 ~ 1830 1835 1840 

TGC CAC TCC CGG GAG AGC CAG GCA GCC ATG GCG CGT CAG GAG GAG ACG 5568 
Cys His Ser Arg Glu Ser Gin Ala Ala Met Ala Arg Gin Glu Glu Thr 
1845 1850 1855 

TCT CAG GAT GAG ACC TAT GAA GTG AAG ATG AAC CAT GAC ACG GAG GCC 5616 
Ser Gin Asp Glu Thr Tyr Glu Val Lys Met Asn His Asp Thr Glu Ala 
1860 ** 1865 1870 

TGC AGT GAG CCC AGC CTG CTC TCC ACA GAG ATG CTC TCC TAC CAG GAT 5664 
Cys Ser Glu Pro Ser Leu Leu Ser Thr Glu Met Leu Ser Tyr Gin Asp 
1875 1880 1885 

GAC GAA AAT CGG CAA CTG ACG CTC CCA GAG GAG GAC AAG AGG GAC ATC 5712 
Asp Glu Asn Arg Gin Leu Thr Leu Pro Glu Glu Asp Lys Arg Asp He 
1890 ~ 1895 1900 

CGG CAA TCT CCG AAG AGG GGT TTC CTC CGC TCT GCC TCA CTA GGT CGA 5760 
Arg Gin Ser Pro Lys Arg Gly Phe Leu Arg Ser Ala Ser Leu Gly Arg 
1905 1910 1915 1920 

AGG GCC TCC TTC CAC CTG GAA TGT CTG AAG CGA CAG AAG GAC CGA GGG 5808 
Arg Ala Ser Phe His Leu Glu Cys Leu Lys Arg Gin Lys Asp Arg Gly 
1925 1930 1935 

GGA GAC ATC TCT CAG AAG ACA GTC CTG CCC TTG CAT CTG GTT CAT CAT 5856 
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Gly Asp lie Ser Gin Lys Thr Val Leu Pro Leu His Leu Val His His 
1940 1945 1950 

CAG GCA TTG GCA GTG GCA GGC CTG AGC CCC CTC CTC CAG AGA AGC CAT 5904 
Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu Gin Arg Ser His 
1955 i960 1965 

TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA CCA GCC ACA CCT 5952 
Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro Pro Ala Thr Pro 
1970 1975 1980 

GGC AGC CGA GGC TGG CCC CCA CAG CCC GTC CCC ACC CTG CGG CTT GAG 6000 
Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr Leu Arg Leu Glu 
1985 1990 1995 2000 

GGG GTC GAG TCC AGT GAG AAA CTC AAC AGC AGC TTC CCA TCC ATC CAC 604 8 

Gly Val Glu Ser Ser Glu Lys Leu Asn Ser Ser Phe Pro Ser lie His 
2005 2010 2015 

TGC GGC TCC TGG GCT GAG ACC ACC CCC GGT GGC GGG GGC AGC AGC GCC 6096 
Cys Gly Ser Trp Ala Glu Thr Thr Pro Gly Gly Gly Gly Ser Ser Ala 
2020 2025 * 2030 

GCC CGG AGA GTC CGG CCC GTC TCC CTC ATG GTG CCC AGC CAG GCT GGG 6144 
Ala Arg Arg Val Arg Pro Val Ser Leu Met Val Pro Ser Gin Ala Gly 
2035 2040 2045 

GCC CCA GGG AGG CAG TTC CAC GGC AGT GCC AGC AGC CTG GTG GAA GCG 6192 
Ala Pro Gly Arg Gin Phe His Gly Ser Ala Ser Ser Leu Val Glu Ala 
2050 2055 2060 

GTC TTG ATT TCA GAA GGA CTG GGG CAG TTT GCT CAA GAT CCC AAG TTC 624 0 

Val Leu He Ser Glu Gly Leu Gly Gin Phe Ala Gin Asp Pro Lys Phe 
2070 2075 2080 2085 

ATC GAG GTC ACC ACC CAG GAG CTG GCC GAC GCC TGC GAC ATG ACC ATA 628 8 

He Glu Val Thr Thr Gin Glu Leu Ala Asp Ala Cys Asp Met Thr He 
2090 2095 * " 2100 

GAG GAG ATG GAG AGC GCG GCC GAC AAC ATC CTC AGC GGG GGC GCC CCA 6336 
Glu Glu Met Glu Ser Ala Ala Asp Asn He Leu Ser Gly Gly Ala Pro 
2105 2110 2115 

CAG AGC CCC AAT GGC GCC CTC TTA CCC TTT GTG AAC TGC AGG GAC GCG 6384 
Gin Ser Pro Asn Gly Ala Leu Leu Pro Phe Val Asn Cys Arg Asp Ala 
2120 2125 2130 

GGG CAG GAC CGA GCC GGG GGC GAA GAG GAC GCG GGC TGT GTG CGC GCG 643 2 

Gly Gin Asp Arg Ala Gly Gly Glu Glu Asp Ala Gly Cys Val Arg Ala 
2135 2135 2140 

CGG GGT CGA CCG AGT GAG GAG GAG CTC CAG GAC AGC AGG GTC TAC GTC 648 0 

Arg Gly Arg Pro Ser Glu Glu Glu Leu Gin Asp Ser Arg Val Tyr Val 
2145 2150 2155 " 2160 

AGC AGC CTG TAGTGGGCGC TGCCAGATGC GGGCTTTTTT TTATTTGTTT CAATGTTCCT 653 9 
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Ser Ser Leu 

AATGGGTTCG TTTCAGAAGT GCCTCACTGT TCTCGT 6575 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
AGACCACGGC TTCCTCGAAT CTTGCGCGAA GCCGCCGGCCA TCGGAGGAG GGATTAATCC 60 
AGACCCGCCG GGGGGTGTTT TCACATTTCT TCCTCTTCGTG GCTGCTCCT CCTATTAAAA 120 
CCATTTTTGG TCC 133 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCTGAGGGC CTTCCGCGTG CTGCGCCCCC TGCGGCTGGT GTCCGGAGTC CCAAGTCTCC 60 
AGGTGGTCCT GAATTCCATC ATCAAGGCC 89 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 84 

(D) OTHER INFORMATION: /note= "An alternative exon of 
alpha- 1C." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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48 



84 



CAC TAT TTC TGT GAT GCA TGG AAT ACA TTT GAC GCC TTG ATT GTT GTG 
His Tyr Phe Cys Asp Ala Trp Asn Thr Phe Asp Ala Leu lie Val Val 
1 5 io 15 

GGT AGC ATT GTT GAT ATA GCA ATC ACC GAG GTA AAC 
Gly ser He Val Asp He Ala He Thr Glu Val Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 144 7163 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1. .143 

(ix) FEATURE: 

(A) NAME/KEY: 3 ' UTR 

(B) LOCATION: 7161.. 7362 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGGCGGCGG CTGCGGCGGT GGGGCCGGGC GAGGTCCGTG CGGTCCCGGC GGCTCCGTGG 60 

CTGCTCCGCT CTGAGCGCCT GCGCGCCCCG CGCCCTCCCT GCCGGGGCCG CTGGGCCGGG 120 

GATGCACGCG GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 

Met Val Arg Phe Gly Asp Glu Leu Gly 
1 5 

GGC CGC TAT GGA GGC CCC GGC GGC GGA GAG CGG GCC CGG GGC GGC GGG 218 
Gly Arg Tyr Gly Gly Pro Gly Gly Gly Glu Arg Ala Arg Gly Gly Gly 
10 15 20 25 

GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC GGC CAG 266 
Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin 
30 35 40 

CGG GTC CTC TAC AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC ATG GCG 314 
Arg Val Leu Tyr Lys Gin Ser lie Ala Gin Arg Ala Arg Thr Met Ala 
45 50 55 

CTG TAC AAC CCC ATC CCG GTC- AAG CAG AAC TGC TTC ACC GTC AAC CGC 362 
Leu Tyr Asn Pro lie Pro Val Lys Gin Asn Cys Phe Thr Val Asn Arg 
60 65 70 
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TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAC GCG AAG 410 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 4 58 

Arg lie Thr Glu Trp Pro Pro Phe Glu Asn Met lie Leu Ala Thr lie 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT GAT GGG 506 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
110 115 120 

GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC TAT TTC 554 
Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 135 

ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG GGC 602 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 150 

TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC GTC ATG 650 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAC 698 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
170 175 180 185 

TTC GAC CTG CGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CCC CTG AAG 746 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TCC ATC ATG 794 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Met 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 842 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC AAG TTC 8 90 

He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe 
235 240 245 

CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 938 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 * 255 260 265 

TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC GAC ACT 986 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC AAC TTT 1034 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly He Thr Asn Phe 
285 * 290 295 
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GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 1082 
Asp Asn lie Leu Phe Ala He Leu Thr Val Phe Gin Cys He Thr Met 
300 305 310 

GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 
Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 
315 320 325 



1130 



ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 1178 
Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 1226 
Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 " 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 1274 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 1322 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 " 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA GAG GAG 1370 
Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 1418 
Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 ' 420 425 

AAT GAC CTG ATC CAC GCA GAG GAG GGA GAG GAC CGG TTT GCA GAT CTC 1466 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 1514 
Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 1562 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 1610 
Phe lie Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 480 485 

TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT TAC AAC 1658 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 500 505 

CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 1706 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
510 515 520 
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CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC CTG GGG 1754 
Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Met Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT GGG GTC 1802 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA 1850 
He Val Gly Ser Val Phe Glu Val Val Trp Ala Ala He Lys Pro Gly 
555 560 565 

AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG AGG ATC 1898 
Ser Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg lie 
570 * 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 1946 
Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 600 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 1994 
Leu Leu Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe 
605 ^ 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT GGG GGA 2042 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
620 625 630 

CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC ACC TTC 2090 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp Thr Phe 
635 640 645 

CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 2138 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
650 655 660 665 

AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC AGC AAA 2186 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 2234 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 ^ 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC CTG GCC 2282 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 

AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 2330 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 
715 720 725 

AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA GTC AGC 2378 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 735 740 745 
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CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG AAC TCG 2426 
Pro Met Ser Ala Ala Asn He Ser He Ala Ala Arg Gin Gin Asn Ser 
750 755 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 24 74 

Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG GAC CCC 2522 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 2570 
Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
795 800 ~ 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 2618 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Gly 
810 815 820 ~ ^ 825 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 2666 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 2 714 

Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 2762 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 ~ 870 

GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC 2810 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 2858 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
89 0 895 900 ~ 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 2906 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 2954 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 935 

CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG GGC GAG 3002 
His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG GAG GCG 3 050 

Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 
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GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC AAG GCG 3098 
Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 985 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC 3146 
Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 1000 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 3194 
Thr Glu Lys Glu Ala Glu lie Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 * 1015 

CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 3242 
Arg Asn His Gin Pro Arg Glu Pro His Cys Asp Leu Glu Thr Ser Gly 
1020 1025 1030 

ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT CTC CAG 32 90 

Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 3338 
Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn Val Thr 
1050 1055 1060 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 3386 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr He Val His He Pro 
1070 1075 1080 

GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 3434 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 34 82 

Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 1105 1110 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 353 0 

Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr Ser Ser 
1115 1120 1125 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 3 57 8 

Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys His Tyr 
1130 1135 1140 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 3626 
He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
1150 1155 1160 

TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp Ser 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3 722 

Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr Gly Val 
1180 1185 " 1190 
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TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 3770 
Phe Thr Phe Glu Met Val lie Lys Met lie Asp Leu Gly Leu Leu Leu 
1195 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 3818 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn lie Leu Asp Phe lie 
1210 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC AAA GGG 3866 
Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 ^ 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC CTG CGG 3914 
Lys Asp lie Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 ~ 1255 

CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG TTT GAC 3962 
Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 4010 
Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 

ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC TTC AAA 4 058 

Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1290 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC 4106 
Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG 4154 
Cys Arg Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 4202 
Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 425 0 

Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 42 98 

Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
1370 1375 1380 1385 

CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC TTT GTG 4346 
Pro Gly Tyr Arg Met Glu Leu Ser He Phe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 43 94 

Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu He He He 
1405 1410 1415 
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ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 4442 
Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 4490 
Lys Asn Glu Arg Ala Cys lie Asp Phe Ala lie Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG ACG TGG 4538 
Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Trp 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 4 586 

Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe lie Met Ala Met lie 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 4634 
Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 4682 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 4 730 

Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG GGA 4 778 

Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG GAA ACG AAC AAT TTC 4B26 
Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 4874 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 4 922 

Leu Leu Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val 
1580 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 4 97 0 

Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 " 1600 1605 

TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 5018 
Phe Phe He Tyr Ala -He He Gly Met Gin Val Phe Gly Asn He Ala 
1610 1615 1620 1625 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC AAC AAC TTC CGG ACG TTT 5066 
Leu Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1630 1635 1640 
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t CTG £ TG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC TGG 
Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 

1645 1650 1655 

Hi^ tT° 5^ r CTG I CC TGC CTG AGC **C CAG GCC TGT GAT GAG CAG 

His Glu lie Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 i 670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 
Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe £he vll 

1675 1680 1685 

TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 
Ser Phe He Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe 52 21 
1690 1695 1700 1705 

SS T?f mI? f C ^ T IT ^ G TAC CTC ACG CGG GAC TCT TCC ATC CTA 
Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser He Leu 

1710 1715 ~ 1720 

GGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 
Gly Pro His His Leu Asp Glu Phe He Arg Val Trp Ala 3u ?yr Sp 
172 5 1730 ~ 1735 

Pro 111 n GT ^° ^ A ? C AGT TAC GAC ATG TTT GAG ATG CTG 

Pro Ala Ala cys Gly Arg He Ser Tyr Asn- Asp Met Phe Glu Met Leu 

1740 1745 1750 

i£ «S mI? I CC S CG CCT CTG GGG CTG GGG ^ AAA TGC CCT GCT CGA 
Lys His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 

1755 1760 1765 

vI7 I AC f* 6 CGC CTG GTT CGC ATG ATG CCC ATC TCC AAC GAG 

YSin ^ Lys ^ Leu Val ^ Met Asn M et Pro He Ser Asn Glu 
1770 1775 1780 1785 

GAG A J G AGT GT ? G AC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 
Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 
l 7 ^ 0 1795 1800 

GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 
Ala Leu Glu He Lys Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1805 1810 " 1815 

GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG CCC 
Asp Ala Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu Pro 
I 820 1825 1830 

CAG AAG ACT TTG GAC TTG CTG GTA CCA CCC CAT AAG CCT GAT GAG ATG 
Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
1835 1840 1845 

ACA GTG GGG AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAC AAG 
Thr Val Gly Lys Val Tyr Ala Ala Leu Met He Phe Asp Phe Tyr Lys 
1850 1855 i860 1865 



5114 



5162 



5210 



5258 



5306 



5354 



5402 



5450 



5498 



5546 



5594 



5642 



5690 



5738 
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CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC CTC 5786 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin Gin Ala Pro Gly Gly Leu 
1870 1875 1880 

TCC CAG ATG GGT CCT GTG TCC CTG TTC CAC CCT CTG AAG GCC ACC CTG 5834 
Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1885 1890 1895 

GAG CAG ACA CAG CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 5882 
Glu Gin Thr Gin Pro Ala Val Leu Arg Gly Ala Arg Val Phe Leu Arg 
1900 1905 1910 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAA 5930 
Gin Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala He Gin Asn Gin 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 5978 
Glu Ser Gly lie Lys Glu Ser Val Ser Trp Gly Thr Gin Arg Thr Gin 
1930 1935 1940 1945 

GAT GCA CCC CAT GAG GCC AGG CCA CCC CTG GAG CGT GGC CAC TCC ACA 6026 
Asp Ala Pro His Glu Ala Arg Pro Pro Leu Glu Arg Gly His Ser Thr 
1950 1955 I960 

GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC GTT CAG ATG 6074 
Glu He Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val Gin Met 
1965 1970 1975 

CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 6122 
Gin Ser He Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 6170 
Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 6218 
Gin Pro Val Thr Asp Ala Ser Pro Met Lys Arg Ser He Ser Thr Leu 
2010 2015 2020 2025 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 6266 
Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg 
2030 2035 2040 

CCA CCC CCT AGC CAG GCG TCG TCG CAC CAC CAC CAC CAC CGC TGC CAC 6314 
Pro Pro Pro Ser Gin Ala Ser Ser His His His His His Arg Cys His 
2045 2050 2055 

CGC CGC AGG GAC AGG AAG CAG AGG TCC CTG GAG AAG GGG CCC AGC CTG 6362 
Arg Arg Arg Asp Arg Lys Gin Arg Ser Leu Glu Lys Gly Pro Ser Leu 
2060 2065 2070 

TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT GTG GGG CCG GGG CTG 6410 
Ser Ala Asp Met Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 
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ss sj a 2 w ss 2 2 22 - G ^ 2 2 

SS 2 22 2 22 - - - « « s s g« 

is ss 2 a - « 2 2 2 2 2 s 2 2 22 



6458 

2105 

6506 



2130 2135 



s s 22 2 s s ss 2 2 a s s s 2 s 

2145 2150 

S £ SS 2 SS 2 gg S 2 22 2 2 2 



2165 



2245 



2?° I CT CGA ATT GGC TCT GAC C CT TAC CTG GGG CAG CGT CTG GAC 
JroQly ser Arg He Gly Ser Asp Pro Tyr Leu Gly £g Sp 

5 2260 2265 

SS S sS Si SS J£ P G ^ T 2?° GAC ACG CTC ACT ™ «0 

Ser I** Ala Leu Pro Olu Asp Thr Leu Thr Phe Glu 

2270 2275 2280 

SS J2 Si S JSr jSS S 2?° CGC TCC TCC AGG ACT TCC ™ GTG 
Val „?_ Itar Asn Ser G1 y ^9 Ser Ser Arg Thr Ser Tyr Val 
2285 2290 2295 

TCC TCC CTG ACC TCC CAG TCT CAC CCT CTC CGC CGC GTG ccc aar 
Ser ser Leu Thr Ser Gin Ser His Pro LeS 2g Sg Si £o £S g?J 
2300 2305 2310 



6554 



6602 



6650 



ays ss a 2 ss 5 s s s s 2 2 25 2 

2175 2180 2185 

2 S- |g SS S SS 2 S SS SS SS 2 2 

190 2195 2200 

2 ss 2 22 2 s s 22 2 2 SS IS SI SS 

2 2 2 2 SS 2 2 |«S 2 2 2 22 2 

2225 2230 

as as 2 ss 2 2 ss 2 2 2 2 2 ss 2 2 2 «« 

224 0 



6698 



6746 



6794 



6842 



6938 



6986 



7034 



7082 
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TAC CAC TGC ACC CTG GGA CTC AGC TCG GGT GGC CGA GCA CGG CAC AGC 7130 
Tyr His Cys Thr Leu Gly Leu Ser Ser Gly Gly Arg Ala Arg His Ser 
2315 2320 2325 

TAC CAC CAC CCT GAC CAA GAC CAC TGG TGC TAGCTGCACC GTGACCGCTC 7180 
Tyr His His Pro Asp Gin Asp His Trp Cys 
2330 2335 234 

AGACGCCTGC ATGCAGCAGG CGTGTGTTCC AGTGGATGAG TTTTATCATC CACACGGGGC 7240 
AGTCGGCCCT CGGGGGAGGC CTTGCCCACC TTGGTGAGGC TCCTGTGGCC CCTCCCTCCC 7300 
CCTCCTCCCC TCTTTTACTC TAGACGACGA ATAAAGCCCT GTTGCTTGAG TGTACGTACC 7360 
GC 7362 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 144.. 6857 

(ix) FEATURE: 

(A) NAME/KEY: 5 ' UTR 

(B) LOCATION: 1..143 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 6855.. 7175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCGGCGG CTGCGGCGGT GGGGCCGGGC GAGGTCCGTG CGGTCCCGGC GGCTCCGTGG 60 

CTGCTCCGCT CTGAGCGCCT GCGCGCCCCG CGCCCTCCCT GCCGGGGCCG CTGGG CCGGG 12 0 

GATGCACGCG GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 

Met Val Arg Phe Gly Asp Glu Leu Gly 
1 5 

GGC CGC TAT GGA GGC CCC GGC GGC GGA GAG CGG GCC CGG GGC GGC GGG 218 
Gly Arg Tyr Gly Gly Pro Gly Gly Gly Glu Arg Ala Arg Gly Gly Gly 
10 ~ 15 20 25 

GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC GGC CAG 266 
Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin 
30 ' 35 40 
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CGG GTC CTC TAC AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC ATG GCG 314 
Arg Val Leu Tyr Lys Gin Ser He Ala Gin Arg Ala Arg Thr Met Ala 
45 50 55 

CTG TAC AAC CCC ATC CCG GTC AAG CAG AAC TGC TTC ACC GTC AAC CGC 362 
Leu Tyr Asn Pro He Pro Val Lys Gin Asn Cys Phe Thr Val Asn Arg 
60 65 70 

TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAC GCG AAG 410 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 4 58 

Arg He Thr Glu Trp Pro Pro Phe Glu Asn Met He Leu Ala Thr He 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT GAT GGG 506 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
HO lis 120 

GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC TAT TTC 554 
Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 " 135 

ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG GGC 602 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 150 

TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC GTC ATG 650 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAC 698 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
I 70 175 180 185 

TTC GAC CTG CGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CCC CTG AAG 746 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TCC ATC ATG 794 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Met 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 842 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC AAG TTC 890 
He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe 
235 240 245 

CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 938 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 255 260 265 
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TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC GAC ACT 986 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC AAC TTT 1034 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly lie Thr Asn Phe 
285 290 295 

GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 1082 
Asp Asn lie Leu Phe Ala He Leu Thr Val Phe Gin Cys He Thr Met 
300 305 310 

GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 1130 
Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 
315 * 320 325 

ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 1178 
Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 * 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 1226 
Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 1274 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 1322 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA GAG GAG 1370 
Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 1418 
Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 420 425 

AAT GAC CTG ATC CAC GCA GAG GAG GGA GAG GAC CGG TTT GCA GAT CTC 1466 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 1514 
Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 1562 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 " 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 1610 
Phe He Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 ^ 480 485 
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1658 



1706 



1802 



1850 



TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT TAC AAC 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 " 500 505 

CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
51 0 515 520 

CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC CTG GGG 1754 
Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Met Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT GGG GTC 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA 
lie Val Gly Ser Val Phe Glu Val Val Trp Ala Ala He Lys Pro Gly 
555 560 565 

AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG AGG ATC 1898 
Ser Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
570 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 
Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 * €00 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 
Leu Leu Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe 
605 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT GGG GGA 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
620 625 630 

CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC ACC TTC 2090 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp Thr Phe 
635 640 645 

CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 2138 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
650 655 660 665 

AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC AGC AAA 2186 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 2234 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC CTG GCC 2282 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 



1946 



1994 



2042 
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AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 2330 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 
715 720 725 

AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA GTC AGC 2378 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 735 740 745 

CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG AAC TCG 2426 
Pro Met Ser Ala Ala Asn lie Ser He Ala Ala Arg Gin Gin Asn Ser 
750 755 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 2474 
Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG GAC CCC 2522 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 2570 
Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
795 ~ ~ 800 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 2618 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Gly 
810 815 820 825 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 2666 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 2714 
Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 2762 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 870 

GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC 2810 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 2858 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
890 895 900 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 2906 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 2954 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 935 
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CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG GGC GAG 3 002 

His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG GAG GCG 3050 
Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 

GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC AAG GCG 3098 
Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 985 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC 3146 
Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 1000 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 3194 
Thr Glu Lys Glu Ala Glu He Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 1015 

CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 3242 
Arg Asn His Gin Pro Arg Glu Pro His Cys Asp Leu Glu Thr Ser Glv 
1020 1025 1030 

ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT CTC CAG 3290 
Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 3338 
Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn Val Thr 
1050 1055 1060 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 3386 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr lie Val His He Pro 
1070 1075 1080 

GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 3434 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 3482 
Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 H05 mo 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 3530 
Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr Ser Ser 
1115 1120 H25 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 3578 
Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys His Tyr 
H30 1135 H40 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 3626 
He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
1150 H55 1160 
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TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser lie Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp Ser 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3 722 

Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr Gly Val 
1180 H85 1190 

TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 3770 
Phe Thr Phe Glu Met Val He Lys Met He Asp Leu Gly Leu Leu Leu 
1195 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 3818 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn He Leu Asp Phe He 
1210 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC AAA GGG 3 666 

Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC CTG CGG 3914 
Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 1255 

CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG TTT GAC 3 962 

Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 4 010 

Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 

ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC TTC AAA 4 058 

Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1290 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC 4106 
Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG 4154 
Cvs Ara Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 42 02 

Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 4250 
Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 4298 
Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
1370 1375 1380 1385 
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CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC TTT GTG 4346 
Pro Gly Tyr Arg Met Glu Leu Ser lie Phe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 43 94 

Val Phe Pro Phe Phe Phe Val Asn lie Phe Val Ala Leu lie lie lie 
1405 1410 1415 

ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 4442 
Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 4490 
Lys Asn Glu Arg Ala Cys lie Asp Phe Ala lie Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG ACG TGG 453 8 

Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Trp 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 4586 
Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe lie Met Ala Met lie 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 4634 
Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 4682 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 4 73 0 

Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG GGA 4778 
Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG GAA ACG AAC AAT TTC 4 826 

Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 4874 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 4922 
Leu Leu Arg Gin Gly Tyr Thr He Arg lie Leu Leu Trp Thr Phe Val 
1580 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 4970 
Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 1600 1605 
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TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 50X8 
Phe Phe He Tyr Ala He He Gly Met Gin Val Phe Gly Asn He Ala 
1610 1615 1620 1625 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC AAC AAC TTC CGG ACG TTT . 5066 

Leu Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1630 1635 1640 

TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC TGG 5114 
Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 
1645 1650 1655 

CAC GAG ATC ATG CTG TCC TGC CTG AGC AAC CAG GCC TGT GAT GAG CAG 5162 
His Glu He Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 1670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 5210 
Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe Val 
1675 1680 1685 

TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 5258 
Ser Phe He Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe Val Ala 
1690 1695 1700 1705 

GTG ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA 5306 
Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser lie Leu 
1710 1715 1720 

GGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 5354 
G?y Pro Ss His Leu Asp Glu Phe He Arg Val Trp Ala Glu Tyr Asp 
1725 1730 1735 

CCG GCT GCG TGT GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 5402 
Pro aS Ala Cys Gly Arg He Ser Tyr Asn Asp Met Phe Glu Met Leu 
1740 1745 1750 

AAA CAC ATG TCC CCG CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 
Lys" His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 
175 5 1760 1765 

GTT GCT TAC AAG CGC CTG GTT CGC ATG AAC ATG CCC ATC TCC AAC GAG 
Sal aS ™ £s Arg Leu Val Arg Met Asn Met Pro He Ser Asn Glu 
1770 1775 1780 

GAC ATG ACT GTT CAC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 
Asp Me? ?hr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 
1790 1795 

S 25 IS SS 52 SS S S S STv SS ^ S S 35 SS 

1805 1810 18115 

s= s SS s s ™ ^ s SS SS SS 5 s SS ss SS 

1820 1825 



5450 



5498 



5546 



5594 



5642 
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2£ wo J TG ^ AC ^ CTG GTA CCA CCC AAG CCT GAT GAG ATG 

Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
leJS 1840 1845 

ACA GTG GGG AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAC Air 
Thr Val Gly Lys Val Tyr Ala Ala Leu Met lit pte Sp Sne T?r L^s 

1855 I860 1865 

CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC OTP 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin oln 2a P?o 2?y" G?5 SS 
1870 1875 ' 18 B0 

Ser Gin mI G r? T S*? TCC CTG ^ ^ CCT CTG *™ GCC ACC CTG 

Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 

885 1890 1895 

r AG CA ° CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 

Glu Gin Thr^Gln Pro Ala Val Le^Arg Gly Ala Arg ValJSe SJ 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAR 
Gin Lys ser Ser Thr Ser Leu Ser Asn Gly Gly AlS 5J 3! j£ n §£ 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAP 
Glu Ser Gly He Lys Glu Ser Val Ser Trp Gly Thr SJ ^g £ gj 
" 30 1935 1940 i94 5 

^CC CAT GAG GCC AGG CCA CCC CTG GAG CGT GGC CAC TCC ACA 
Asp Ala Pro His Glu Ala Arg Pro Pro Leu Glu Arg Gly His Ser Thr 
1950 1955 ~ i9 6 o 

GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC GTT CAG ATG 
Glu He Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val Gin mH 
1965 1970 i9 75 

CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 
Gin Ser lie Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 
Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 
?i n rt Pr ° Val Thr Asp Ala Ser Pro Met ^9 Ser He Ser Thr Leu 

2010 2015 2020 2025 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 
Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg 
2 °30 2035 2040 

CCA CCC CCT AGC CAG GCG TCG TCG CAC CAC CAC CAC CAC CGC TGC CAC 
Pro Pro Pro Ser Gin Ala Ser Ser His His His His His Arg Cys His 
2045 2050 2055 



5690 



5738 



5786 



5834 



5882 



5930 



5978 



6026 



6074 



6122 



6170 



6218 



6266 



6314 
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CGC CGC AGG GAC AGG AAG CAG AGG TCC CTG GAG AAG GGG CCC AGC CTG 
Sg Arg Arg Asp Arg Lys Gin Arg Ser Leu Glu Lys Gly Pro Ser Leu 
2060 2065 2070 

TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT GTG GGG CCG GGG CTG 
Ser Ala Asp Met Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 

CCC CCG GGA GAG GGG CCT ACA GGC TGC CGG CGG GAA CGA GAG CGC CGG 
Pro Pro Gly Glu Gly Pro Thr Gly Cys Arg Arg Glu Arg Glu Arg Arg 
2090 2095 2100 2105 

CAG GAG CGG GGC CGG TCC CAG GAG CGG AGG CAG CCC TCA TCC TCC TCC 
Gin Glu Arg Gly Arg Ser Gin Glu Arg Arg Gin Pro Ser Ser Ser Ser 
2110 2115 2120 

TCG GAG AAG CAG CGC TTC TAC TCC TGC GAC CGC TTT GGG GGC CGT GAG 
Ser Glu Lys Gin Arg Phe Tyr Ser Cys Asp Arg Phe Gly Gly Arg Glu 
2125 2130 2135 

CCC CCG AAG CCC AAG CCC TCC CTC AGC AGC CAC CCA ACG TCG CCA ACA 
Pro Pro Lys Pro Lys Pro Ser Leu Ser Ser His Pro Thr Ser Pro Thr 
2140 * 2145 2150 

GCT GGC CAG GAG CCG GGA CCC CAC CCA CAG GCC GGC TCA GCC GTG GGC 
Ala Gly Gin Glu Pro Gly Pro His Pro Gin Ala Gly Ser Ala Val Gly 
2155 2160 2165 

TTT CCG AAC ACA ACG CCC TGC TGC AGA GAG ACC CCC TCA GCC AGC CCC 
Se Pro £n Tnr Thr Pro Cys Cys Arg Glu Thr Pro Ser Ala Ser Pro 
2170 2175 2180 2185 

Trr CCC CTG GCT CTC GAA TTG GCT CTG ACC CTT ACC TGG GGC AGC GTC 
Pro Leu A?a Leu S5 Leu Ala Leu Thr Leu Thr Trp Gly Ser Val 
2190 2195 2200 

TGG ACA GTG AGG CCT CTG TCC ACG CCC TGC CTG AGG ACA CGC TCA CTT 
™ Thr Sal Arg Pro Leu Ser Thr Pro Cys Leu Arg Thr Arg Ser Leu 
2205 2210 2215 

S S 5 S 3 S£ S S S S S5 S SS S £ 

2220 2225 
ACG TGT CCT CCC TGACCTCCCA GTCTCACCCT CTCCGCCGCG TGCCCAACGG 
Thr Cys Pro Pro 
2235 

TTACCACTGC ACC CTGGGAC TCAGCTCGGG TGGCCGAGCA CGGCACAGCT ACCACCACCC 
TGACCAAGAC CACTGGTGCT AGCTGCACCG TGACCGCTCA GACGCCTGCA TGCAGCAGGC 
• GTGTGTTCCA GTGGATGAGT TTTATCATCC ACACGGGGCA GTCGGCCCTC GGGGGAGGCC 
TTGCCCACCT TGGTGAGGCT CCTGTGGCCC CTCCCTCCCC CTCCTCCCCT CTTTTACTCT 



6362 

6410 

6458 

6506 

6554 

6602 

6650 

6698 

6746 

6794 

6842 

6894 

6954 
7014 
7074 
7134 
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AGACGACGAA TAAAGCCCTG TTGCTTGAGT GTACGTACCG C 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1437 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1435.. 1546 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



7175 



ATG GTC CAG 
Met Val Gin 
1 

GAG ATC CCC 
Glu He Pro 



AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 
Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser 2S 
5 10 15 

ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Met Glu Val Phe Asp Pro Ser Pro Gin Gly 2^ ™ sir 

25 30 

GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 
Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
40 45 

TCC ** C AGC TTT GTC CGC GGC TCA GCG GAG TCC TAC ACT 

Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Sa 22 Ser J£ ?£ 

55 go 

sir S S5 c CT ? AT GT ^ TCT CTG GAG GAG GAC Cg ^ GAA GCC 

Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 

70 75 " 80 



AAG AGG AAA 
Lys Arg Lys 
35 



48 



96 



144 



192 



240 



TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu i£ £S 
85 90 95 

^ G AGC GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 

Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 no 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro ser Pro Gly Asp Glu Val Pro Val Gin Gly Val S lie Thr lie 
115 120 12 5 



288 



336 



384 
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GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His lie Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 



TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp lie Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 " ISO 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 
Arg Sn Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 I 90 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
19 5 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 
Ser Ala Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 

GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 
Sal Sal Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG 
Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 2&b 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT 
X! Ss Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
260 265 2/u 

ATT TCC CTG GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 
S Ser Su Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He 
275 280 285 

ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 
He He 32 Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 

295 3Uu 



290 



SS.K £ S S S5 SS S 5 i SS 35 SS SS S 

310 315 



305 



SS SS S 2£ S S !S SS S S ^ SS JS S S 

325 330 



SS S IS s s si ss s SS s ?ss is ss g S? SS 

340 345 JD 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



BNSDOCID: <WO 9504822A1_I_> 



WO 95/04822 



PCT/US94/09230 



-162- 



CAA AGG CTC ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT 1104 
Gin Arg Leu lie Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 1152 
Val Gin He Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 1200 
Phe Asp He He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 400 

CTG GCG GAG TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 1248 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 1296 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GTA CAG GTG CTC 1344 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Val Gin Val Leu 
435 440 445 

ACC TCG CTC AGG AGA AAC CTC GGC TTC TGG GGC GGG CTG GAG TCC TCA 1392 
Thr Ser Leu Arg Arg Asn Leu Gly Phe Trp Gly Gly Leu Glu Ser Ser 
450 455 460 

CAG CGG GGC AGT GTG GTG CCC CAG GAG CAG GAA CAT GCC ATG TAGTGGGCGC 1444 
Gin Arg Gly Ser Val Val Pro Gin Glu Gin Glu His Ala Met 
465 470 475 

CCTGCCCGTC TTCCCTCCTG CTCTGGGGTC GGAACTGGAG TGCAGGGAAC ATGGAGGAGG 1504 
AAGGGAAGAG CTTTATTTTG TAAAAAAATA AGATGAGCGG CA 154 6 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1797 

(D) OTHER INFORMATION: /standard_name= "Betal-3" 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1795.. 1851 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 4 8 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 10 15 

GAG ATC CCC ATG GGA GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 96 
Glu He Pro Met Gly Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 ** 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 24 0 

Ser Arq Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 288 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 336 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 HO 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 4 32 

Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 " 150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 
Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 I 70 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 I 90 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
£ Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 
Zr Ala Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 



528 



576 



624 



672 
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GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 720 
Val Val Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lvs 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG 768 
Gly Tyr- Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT 816 
Lys His Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
260 265 270 

ATT TCC CTG GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 864 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His lie 
275 280 285 

ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 912 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
290 295 300 

GAA ATC GAG CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT 960 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Ala 
305 310 315 320 

CTG GAT GCT GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC AAG ACC TCG 1008 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 

CTG GCC CCC ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT 1056 
Leu Ala Pro He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu 
340 345 350 

CAA AGG CTC ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT 1104 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 1152 
Val Gin He Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 1200 
Phe Asp He He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 * 400 

CTG GCG GAG TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 1248 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 1296 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GGA CCC TAC CTT 1344 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Gly Pro Tyr Leu 
435 440 445 
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GCT TCC GGG GAC CAG CCA CTG GAA CGG GCC ACC GGG GAG CAC GCC AGC 1392 
Ala Ser Gly Asp Gin Pro Leu Glu Arg Ala Thr Gly Glu His Ala Ser 
450 ~ 455 460 

ATG CAC GAG TAC CCA GGG GAG CTG GGC CAG CCC CCA GGC CTT TAC CCC 144 0 

Met His Glu Tyr Pro Gly Glu Leu Gly Gin Pro Pro Gly Leu Tyr Pro 
465 470 475 480 

AGC AGC CAC CCA CCA GGC CGG GCA GGC ACG CTA CGG GCA CTG TCC CGC 1488 
Ser Ser His Pro Pro Gly Arg Ala Gly Thr Leu Arg Ala Leu Ser Arg 
485 490 495 

CAA GAC ACT TTT GAT GCC GAC ACC CCC GGC AGC CGA AAC TCT GCC TAC 1536 
Gin Asp Thr Phe Asp Ala Asp Thr Pro Gly Ser Arg Asn Ser Ala Tyr 
500 505 510 

ACG GAG CTG GGA GAC TCA TGT GTG GAC ATG GAG ACT GAC CCC TCA GAG 1584 
Thr Glu Leu Gly Asp Ser Cys Val Asp Met Glu Thr Asp Pro Ser Glu 
515 520 525 

GGG CCA GGG CTT GGA GAC CCT GCA GGG GGC GGC ACG CCC CCA GCC CGA 1632 
Gly Pro Gly Leu Gly Asp Pro Ala Gly Gly Gly Thr Pro Pro Ala Arg 
530 * 535 540 

CAG GGA TCC TGG GAG GAC GAG GAA GAA GAC TAT GAG GAA GAG CTG ACC 168 0 

Gin Gly Ser Trp Glu Asp Glu Glu Glu Asp Tyr Glu Glu Glu Leu Thr 
545 550 555 560 

GAC AAC CGG AAC CGG GGC CGG AAT AAG GCC CGC TAC TGC GCT GAG GGT 172 8 

Asp Asn Arg Asn Arg Gly Arg Asn Lys Ala Arg Tyr Cys Ala Glu Gly 
565 570 575 

GGG GGT CCA GTT TTG GGG CGC AAC AAG AAT GAG CTG GAG GGC TGG GGA 177 6 

Gly Gly Pro Val Leu Gly Arg Asn Lys Asn Glu Leu Glu Gly Trp Gly 
580 585 590 

CGA GGC GTC TAC ATT CGC TGAGAGGCAG GGGCCACACG GCGGGAGGAA 
Arg Gly Val Tyr lie Arg 
595 



GGGCTCTGAG CCCAGGGGAG GGGAGGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3310 

(D) OTHER INFORMATION: /standard_name= "Alpna-2 



1824 



1851 
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100 



(ix) FEATURE: 

(A) NAME /KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME /KEY : 3'UTR 

(B) LOCATION: 3308.. 3600 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 52 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 148 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 so 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 HO us 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 * 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 4 84 

Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe He 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
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170 



175 180 



CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Hu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
Glu" Pro Ser Leu Leu Trp Glu Val Phe Qly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
%s lie Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr lie Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 

270 275 



265 



ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Pne 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
£in Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 

jvat cta AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
™l SS S£ £n Lys L^s val Leu Lys Asp Ala Val Asn Asn lie Thr 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala pne 
330 335 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
32 2u HI A^n Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

s a s a; s ss ss a s s ss s° s js as 

360 365 



S S S2 SS £ £ £ S2 S S S S S - 

S SS 52 S S 5 S 5S S K SZ K S S K SS 



628 



676 



724 



772 



820 



868 



916 



964 



1012 



1060 



1108 



1156 



1204 



1252 
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395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He JS 
* xu 415 4 2 o 

JSr ^ I AT 7™ GAT GTT 7X6 GGA CCA ATG GTT TTA GCA GGA 

Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT BCA ttp 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Si £r S aS lIu 
* 4U 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 
Glu Leu Gly Leu Val lie Thr Gly Thr Leu Pro Val p£e itn ile ihr 

460 465 470 

£S SJ JE* AA ° TTA AAG AAC CAG CTG ATT CTT GGT 

Gxy Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 

475 480 485 

22 iS Gli vlt GAT S*? TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 

490 495 soo 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe AlJ He Pro J£ 

510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys £Sn Pro JJJ ser 
520 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu ™ G?u Jin As*J 

540 545 550 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 
He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu Ser Gly GlS 
555 560 5S5 

Jii U C AGA £ CT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 

Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp 
570 575 580 

AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 
Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

stl J™ *? C P G GT ? TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 
600 Hi ^ Ser PhS Ile L ^ S Ala 

J5i SS 2?° A ? A ACT ^ GCC AGA TCA ^ G AAA ATG 

Lys Leu Glu Glu Thr Ile Thr Gin Ala Arg Ser Lys Lys Gly Lys Met 



1300 



1348 



1396 



1444 



1492 



1540 



1588 



1636 



1684 



1732 



1780 



1828 



1876 



1924 
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615 620 625 630 

AAG GAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT 1972 
Lys Asp Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr 
635 640 645 

ACA TTC ATA GCA CCA AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT 2020 
Thr Phe lie Ala Pro Arg Asp Tyr Cys Asn Asp Leu Lys lie Ser Asp 
650 655 660 

AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA 2068 
Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn Glu Phe lie Asp Arg Lys 
665 670 675 

ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG 2116 
Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu lie Asn Arg Val Leu 
680 685 690 

CTT GAT GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG 2164 
Leu Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys 
695 700 4 705 710 

CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT 2212 
Gin Lys Asn He Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly 
715 720 725 

GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA 2260 
Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu 
730 735 740 

AAC CCA GAG ACA TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT 23 08 

Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn 
745 * 750 755 

GAT AAC TAT GTT TTC ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT 23 56 

Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly 
760 765 770 

GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT 24 04 

Ala Tyr Glu Ser Gly He Met Val Ser Lys Ala Val Glu He Tyr He 
775 780 785 790 

CAA GGG AAA CTT CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA 2452 
Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly He Lys He Asp Val 
795 800 805 

AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT 2500 
Asn Ser Trp He Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys 
810 815 820 

GCT GGT CCA GTT TGT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT 2548 
Ala Gly Pro Val Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys 
825 ~ ~ 830 835 

GTG ATT CTG GAT GAT GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT 2596 
Val He Leu Asp Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp 
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840 845 850 

s ss ss a 2 2 s s si 2 s e 2 2 2 2 

0/5 880 885 

2 2 2 2 2 si ss s 2 s s s 2 2 2 s 

895 9 00 

2 2 2 IS 2 2 2 2 2 2 2 2 2 SS 2 2768 



915 



2£ 5S 5£S «E 55? ? CC ^ G «» AGG ~C ATT ACT GAA CAA 

Gin Ser Cys Ile Thr Glu 
960 965 

Thr Sin d^ C ? AT AAC GAC AGT TCA TTC ACT GGT GTA TTA 

Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu 

975 980 

Sn rtl o? A ~ GT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC 

Asp cys Gly Asn Cys Ser Arg He Phe His Gly Glu Lys EeG Se? j£S 

990 995 

Th^ ^ ?" TA A ? A ^ C ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT 

?ooo Ile phe Ile ? 0ft , Val Glu ser h y* G1 y ^ pS SI 

1005 1010 

GA <T ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT 
Asp Thr Arg Leu Leu lie Gin Ala Glu Gin Thr Ser Asp SJ Prc 

1020 1025 103 0 

P°J 251 GAC ATG GT 7' ^ ^ CCT AGA TAC CGA AAA GGG CCT GAT GTC 
Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val 

1035 1040 " 1045 

Phi ^ C 2™ f r TC TT ° GAG GAT TAT ACT GAC ^T GGT GGT GTT 

Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val 

1050 1055 1060 

ler 2w t T I A GC ° l° C CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA 

Ser Gly Leu Asn Pro Ser Leu Trp Tyr lie He Gly He Gin Phe Leu 



2692 



2740 



2 G y" £5 SS S ?£ S til GCC £ GG TCT ATT CTA CAG TOT ctc 

920 tic Ala Trp Ser lle Leu Gln G1 * Phe Leu 

925 930 

2 2 2 2 2 2 2 2 2 2 S 2 2 ffi 2 2 
2 2 2 2 S 2 2 2 2 2 2 2 2 2 2 2 »» 



2836 



2884 



2980 



3028 



3076 



3124 



3172 



3220 



3268 
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1065 1070 1075 

CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA 3317 
Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leu Leu 
1080 1085 1090 

AAAACCAAAT CTGCATAGTT AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT 3377 

TACAGTAACG TAGGGTCAGC TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC 3437 

ATAACACTAA GGCGCAGACT CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC 34 97 

TTAAACGTGT GTGAATGCTG CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG 3 557 

TCCTCTATTG GAAAATTTGG GCGTTTGTTG TTGCATTGTT GGT 3600 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCCCCTGCCA GTGGCCAAAC AGAAGCAGAA GTCGGGTAAT GAAATGACTA ACTTAGCCTT 6 0 

TGAACTAGAC CCCCTAGAGT TAGAGGAGGA AGAGGCTGAG CTTGGTGAGC AGAGTGGCTC 120 
TGCCAAGACT AGTGTTAGCA GTGTCACCAC CCCGCCACCC CATGGCAAAC GCATCCCCTT 180 
CTTTAAGAAG ACAGAGCATG TGCCCCCCTA TGACGTGGTG CCTTCCATGA GGCCCATCAT 24 0 

CCTGGTGGGA CCGTCGCTCA AGGGCTACGA GGTTACAGAC ATGATGCAGA AAGCTTTATT 300 

323 

TGACTTCTTG AAGCATCGGT TTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
CCTATTGGTG TAGGTATACC AACAATTAAT TTAAGAAAAA GGAGACCCAA TATCCAG 5 7 
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(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 
(B) LOCATION: 1 . . 132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

S E HI S ?ys S SI Sa° S £ C tT C CTC CTC m CTC « 
x a «_ys Ala cys Ala Ala Phe lie Leu Leu Phe Leu Gly 

10 1S 

5 £ S 2 2 2 2 E 2 S S S 2 2 2 2 

25 30 

2 SS E 2 £ «£ s 2 2 2 His ™™ — 

35 40 
CGACCCTCAG GCTTCTTCCC AGGAAGCGGG G 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid- 
(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AATTCGGTAC GTACACTCGA GC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid • 
(A) DESCRIPTION: Oligonucleotide 



48 



96 



149 



180 



22 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16 
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GCTCGAGTGT ACGTACCG 18 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCATGGTACC TTCGTTGACG 20 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:18: 

AATTCGTCAA CGAAGGTACC ATGG 24 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 53.. 1504 

(D) OTHER INFORMATION: /standard_name= "Beta-3-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGCCTCGGA CCCCCTGTCC CGGGGGAGGG GGAGAGCCCG CTACCCTGGT CT ATG 55 

1 

TCT TTT TCT GAC TCC AGT GCA ACC TTC CTG CTG AAC GAG GGT TCA GCC 103 
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Ser Phe Ser Asp Ser Ser Ala Thr Phe Leu Leu Asn Glu Gly Ser Ala 
5 10 15 

Asp Ser Th^ t GC ? GC CCA TCT CTG GAC TCA GAC GTC TCG CTG GAG 
Asp Ser Tyr Thr Ser Arg Pro Ser Leu Asp Ser Asp Val Ser Leu Glu 

GAG GAC CGG GAG AGT GCC CGG CGT GAA GTA GAG AGC CAG GCT CAG CAP 
Glu Asp Arg Glu Ser Ala Arg Arg Glu Val Glu Jer Sn SI Sn 22 

40 45 

CAG CTC GAA AGG GCC AAG CAC AAA CCT GTG GCA TTT GCG GTG AGG ACC 
Gin Leu Glu Arg Ala Lys His Lys Pro Val Ala Phe *2 Val A^g J£ 

55 go 6s 

ii« vl° TGT GGC GTA CTG GAT GAG GAG TGC CCA GTC CAG GGC 

Asn Val ser Tyr Cys Gly Val Leu Asp Glu Glu Cys Pro Val Gin S£ 

Ser vJ° U T S AG GCC ^ GAT TTT CTG CAC ATT AAA GAG AAG 

Ser Gly Val Asn Phe Glu Ala Lys Asp Phe Leu His lie Lys Glu Lys 
85 90 95 

5£ Ser GAC * GG A T° GGG CGG CTA GTG AAA GAG GGC GGG GAC 

Tyr Ser Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu Gly Gly Asp 
■ LU0 105 no 

iTf GC f Ik C A T C CCC AGC CCC CAG CGC CTG GAG AGC ATC CGG CTC AAA 
He Ala Phe He Pro Ser Pro Gin Arg Leu Glu Ser He Arg Leu Lys 
J " Li> 120 125 1 

Sin P? G ^ AAG G ? C AGG AGA TCT GGG AAC CCT TCC AGC CTG AGT GAC 
Gin Glu Gin Lys Ala Arg Arg Ser Gly Asn Pro Ser Ser Leu Ser Asp 

"5 140 145 

He SS ^ CGA CGC TCC CCT CCG CCA TCT CTA GCC AAG CAG AAG CAA 
He Gly Asn Arg Arg Ser Pro Pro Pro Ser Leu Ala Lys Gin Lys Gin 

150 155 160 

AAG CAG GCG GAA CAT GTT CCC CCG TAT GAC GTG GTG CCC TCC ATG CGG 
Lys Gin Ala Glu His Val Pro Pro Tyr Asp Val Val Pro Ser mII Sg 
16 5 170 175 3 

v TG fr TG CTG GTG GGA CCC TCT CTG AAA GGT TAT GAG GTC ACA GAC 
Pro Val Val Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp 
180 185 190 V 

mI G AAG GCT CTC TTC GAG TTC CTC AAA CAC AGA TTT GAT GGC 

Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg Phe Asp Gly 
195 200 205 

AGG ATC TCC ATC ACC CGA GTC ACA GCC GAC CTC TCC CTG GCA AAG CGA 
Arg He Ser He Thr Arg Val Thr Ala Asp Leu Ser Leu Ala Lys ArJ 

215 220 225 

TCT GTG CTC AAC AAT CCG GGC AAG AGG ACC ATC ATT GAG CGC TCC TCT 



151 



199 



247 



295 



343 



391 



439 



487 



535 



583 



631 



679 



727 



775 
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Ser Val Leu Asn Asn Pro Gly Lys Arg Thr lie lie Glu Arg Ser Ser 
230 235 240 

GCC CGC TCC AGC ATT GCG GAA GTG CAG AGT GAG ATC GAG CGC ATA TTT 823 
Ala Arg Ser Ser He Ala Glu Val Gin Ser Glu He Glu Arg He Phe 
245 250 255 

GAG CTG GCC AAA TCC CTG CAG CTA GTA GTG TTG GAC GCT GAC ACC ATC 871 
Glu Leu Ala Lys Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr He 
260 265 270 

AAC CAC CCA GCA CAG CTG GCC AAG ACC TCG CTG GCC CCC ATC ATC GTC 919 
Asn His Pro Ala Gin Leu Ala Lys Thr Ser Leu Ala Pro He He Val 
275 280 2B5 

TTT GTC AAA GTG TCC TCA CCA AAG GTA CTC CAG CGT CTC ATT CGC TCC 967 
Phe Val Lys Val Ser Ser Pro Lys Val Leu Gin Arg Leu He Arg Ser 
290 295 300 305 

CGG GGG AAG TCA CAG ATG AAG CAC CTG ACC GTA CAG ATG ATG GCA TAT 1015 
Arg Gly Lys Ser Gin Met Lys His Leu Thr Val Gin Met Met Ala Tyr 
310 315 320 

GAT AAG CTG GTT CAG TGC CCA CCG GAG TCA TTT GAT GTG ATT CTG GAT 1063 
Asp Lys Leu Val Gin Cys Pro Pro Glu Ser Phe Asp Val He Leu Asp 
325 330 335 

GAG AAC CAG CTG GAG GAT GCC TGT GAG CAC CTG GCT GAG TAC CTG GAG 1111 
Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu Tyr Leu Glu 
340 345 350 

GTT TAC TGG CGG GCC ACG CAC CAC CCA GCC CCT GGC CCC GGA CTT CTG 115 9 

Val Tyr Trp Arg Ala Thr His His Pro Ala Pro Gly Pro Gly Leu Leu 
355 " ~ 360 365 

GGT CCT CCC AGT GCC ATC CCC GGA CTT CAG AAC CAG CAG CTG CTG GGG 1207 
Gly Pro Pro Ser Ala He Pro Gly Leu Gin Asn Gin Gin Leu Leu Gly 
370 375 380 385 

GAG CGT GGC GAG GAG CAC TCC CCC CTT GAG CGG GAC AGC TTG ATG CCC 12 55 

Glu Arg Gly Glu Glu His Ser Pro Leu Glu Arg Asp Ser Leu Met Pro 
390 395 400 

TCT GAT GAG GCC AGC GAG AGC TCC CGC CAA GCC TGG ACA GGA TCT TCA 13 03 

Ser Asp Glu Ala Ser Glu Ser Ser Arg Gin Ala Trp Thr Gly Ser Ser 
405 410 415 

CAG CGT AGC TCC CGC CAC CTG GAG GAG GAC TAT GCA GAT GCC TAC CAG 13 51 

Gin Arg Ser Ser Arg His Leu Glu Glu Asp Tyr Ala Asp Ala Tyr Gin 
420 425 430 

GAC CTG TAC CAG CCT CAC CGC CAA CAC ACC TCG GGG CTG CCT AGT GCT 13 99 

Asp Leu Tyr Gin Pro His Arg Gin His Thr Ser Gly Leu Pro Ser Ala 
435 440 445 

AAC GGG CAT GAC CCC CAA GAC CGG CTT CTA GCC CAG GAC TCA GAA CAC 144 7 
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Asn Gly His Asp Pro Gin Asp Arg Leu Leu Ala Gin Asp Ser Glu His 

455 460 46S 

£Z Stf c GT ° AC f GG ^ TGG ^ CGC AAC CGG CCT TGG CCC AAG GAT 
Asn His Ser Asp Arg Asn Trp Gin Arg Asn Arg Pro Trp Pro Lys Asp 
470 475 48O 

Ser t£ ^ ^ C CTCCTGCTGC CCTACCCTGG CAGGCACAGG 

CGCAGCTGGC TGGGGGGCCC ACTCCAGGCA GGGTGGCGTT AGACTGGCAT 
CAGGCTGGCA CTAGGCTCAG CCCCCAAAAC CCCCTGCCCA GCCCCAGCTT CAGGGCTGCC 
TGTGGTCCCA AGGTTCTGGG AGAAACAGGG GACCCCCTCA CCTCCTGGGC AGTGACCCCT 
ACTAGGCTCC CATTCCAGGT ACTAGCTGTG TGTTCTGCAC CCCTGGCACC TTCCTCTCCT 
CCCACACAGG AAGCTGCCCC ACTGGGCAGT GCCCTCAGGC CAGGATCCCC TTAGCAGGGT 
CCTTCCCACC AGACTCAGGG AAGGGATGCC CCATTAAAGT GACAAAAGGG TGGGTGTGGG 
CACCATGGCA TGAGGAAGAA ACAAGGTCCC TGAGCAGGCA CAAGTCCTGA CAGTCAAGGG 
ACTGCTTTGG CATCCAGGGC CTCCAGTCAC CTCACTGCCA TACATTAGAA ATGAGACAAT 
TCAAAGCCCC CCCAGGGTGG CACACCCATC TGTTGCTGGG GTGTGGCAGC CACATCCAAG 
ACTGGAGCAG CAGGCTGGCC ACGCTTGGGC CAGAGAGAGC TCACAGCTGA AGCTCTTGGA 
GGGAAGGGCT CTCCTCACCC AATCG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : DNA (genomic) 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 51.. 1492 
(D) OTHER INFORMATION: /product= 

calcium channel" 



"A Beta3 subunit of human 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGCCCCCGGC GCCGCTCGTT CCCCCGACCC GGACTCCCCC ATGTATGACG ACTCCTACGT 
GCCCGGGTTT GAGGACTCGG AGGCGGTTTC AGCCGACTCC TACACCAGCC GCCCATCTCT 



1495 

1543 

1593 

1648 

1708 

1768 

1828 

1888 

1948 

2008 

2068 

2128 

2153 



60 
120 
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GGACTCAGAC 


GTCTCCCTGG 


AGGAGGACCG 


GGAGAGTGCC 


CGGCGTGAAG 


TAGAGAGCCA 


180 


GGCTCAGCAG 


CAGCTCGAAA 


GGGCCAAGCA 


CAAACCTGTG 


GCATTTGCGG 


TGAGGACCAA 


240 


TGTCAG CTAC 


TGTGGCGTAC 


TGGATGAGGA 


GTGCCCAGTC 


CAGGGCTCTG 


GAGTCAACTT 


300 


TGAGGCCAAA 


GATTTTCTGC 


ACATTAAAGA 


GAAGTACAGC 


AATGACTGGT 


GGATCGGGCG 


360 


GCTAGTGAAA 


GAGGGCGGGG 


ACATCGCCTT 


CATCCCCAGC 


CCCCAGCGCC 


TGGAGAGCAT 


420 


CCGGCTCAAA 


CAGGAGCAGA 


AGGCCAGGAG 


ATCTGGGAAC 


CCTTCCAGCC 


TGAGTGACAT 


480 


TGGCAACCGA 


CGCTCCCCTC 


CGCCATCTCT 


AGCCAAGCAG 


AAG CAAAAGC 


AGGCGGAACA 


540 


TGTTCCCCCG 


TATGACGTGG 


TGCCCTCCAT 


GCGGCCTGTG 


GTGCTGGTGG 


GACCCTCTCT 


600 


GAAAGGTTAT 


GAGGTCACAG 


ACATGATGCA 


GAAGGCTCTC 


TTCGACTTCC 


TCAAACACAG 


660 


ATTTGATGGC 


AGGATCTCCA 


TCACCCGAGT 


CACAGCCGAC 


GTCTCCCTGG 


CAAAGCGATC 


720 


TGTGCTCAAC 


AATCCGGGCA 


AGAGGACCAT 


CATTGAGCGC 


TCCTCTGCCC 


GCTCCAGCAT 


780 


TGCGGAAGTG 


CAGAGTGAGA 


TCGAGCGCAT 


ATTTGAGCTG 


GCCAAATCCC 


TGCAGCTAGT 


840 


AGTGTTGGAC 


GCTGACACCA 


TCAACCACCC 


AGCACAGCTG 


GCCAAGACCT 


CGCTGGCCCC 


900 


CATCATCGTC 


TTTGTCAAAG 


TGTCCTCACC 


AAAGGTACTC 


CAGCGTCTCA 


TTCGCTCCCG 


960 


GGGGAAGTCA 


CAGATGAAGC 


ACCTGACCGT 


ACAGATGATG 


GCATATGATA 


AGCTGGTTCA 


1020 


GTGCCCACCG 


GAGTCATTTG 


ATGTGATTCT 


GGATGAGAAC 


CAGCTGGAGG 


ATGCCTGTGA 


1080 


GCACCTGGCT 


GAGTACCTGG 


AGGTTTACTG 


GCGGGCCACG 


CACCACCCAG 


CCCCTGGCCC 


1140 


VJVJrtL. X X V. X VJ 


VJV3 X V_- ^_ X V_ ^_ 


\J X V3^—^«.X \— V— V— 


LUUnL X X V_i~iVJ 






xzuu 


GCGTGGCGAG 


GAGCACTCCC 


CCCTTGAGCG 


GGACAG CTTG 


ATGCCCTCTG 


ATGAGGCCAG 


1260 


CGAGAGCTCC 


CGCCAAGCCT 


GGACAGGATC 


TTCACAGCGT 


AGCTCCCGCC 


ACCTGGAGGA 


1320 


GGACTATGCA 


GATGCCTACC 


AGGACCTGTA 


CCAGCCTCAC 


CGCCAACACA 


CCTCGGGGCT 


1380 


GCCTAGTGCT 


AACGGGCATG 


ACCCCCAAGA 


CCGGCTTCTA 


GCCCAGGACT 


CAGAACACAA 


1440 


CCACAGTGAC 


CGGAACTGGC 


AGCGCAACCG 


GCCTTGGCCC 


AAGGATAGCT 


ACTGACAGCC 


1500 


TCCTGCTGCC 


CTACCCTGGC 


AGGCACAGGC 


GCAGCTGGCT 


GGGGGGCCCA 


CTCCAGGCAG 


1560 


GGTGGCGTTA 


GACTGGCATC 


AGGCTGGCAC 


TAGGCTCAGC 


CCCCAAAACC 


CCCTGCCCAG 


1620 


CCCCAGCTTC 


AGGGCTGCCT 


GTGGTCCCAA 


GGTTCTGGGA 


GAAACAGGGG 


ACCCCCTCAC 


1680 


CTCCTGGGCA 


GTGACCCCTA 


CTAGGCTCCC 


ATTCCAGGTA 


CTAGCTGTGT 


GTTCTGCACC 


1740 


CCTGGCACCT 


TCCTCTCCTC 


CCACACAGGA 


AGCTGCCCCA 


CTGGGCAGTG 


CCCTCAGGCC 


1800 
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AGGATCCCCT 


TAGCAGGGTC 


CTTCCCACCA 


GACTCAGGGA 


AGGGATGCCC 


CATTAAAGTG 


1860 


ACAAAAGGGT 


GGGTGTGGGC 


ACCATGGCAT 


GAGGAAGAAA 


CAAGGTCCCT 


GAGCAGGCAC 


1920 


AAGTCCTGAC 


AGTCAAGGGA 


CTGCTTTGGC 


ATCCAGGGCC 


TCCAGTCACC 


TCACTGCCAT 


1980 


ACATTAGAAA 


TGAGACAATT 


CAAAGCCCCC 


CCAGGGTGGC 


ACACCCATCT 


GTTGCTGGGG 


2040 


TGTGGCAGCC 


ACATCCAAGA 


CTGGAGCAGC 


AGGCTGGCCA 


CGCTTGGGCC 


AGAGAGAGCT 


2100 


CACAGCTGAA 


GCTCTTGGAG 


GGAAGGGCTC 


TCCTCACCCA 


ATCG 




2144 


(2) INFORMATION FOR SEQ ID NO: 21: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

CTCAGTACCA TCTCTGATAC CAGCCCCA 28 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 237.. 7769 

(D) OTHER INFORMATION: /standard_name= "Alpha-1A-1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CCGGGCAGCC GCCTTCTGAG CCCCCGACCC 60 

GAGGCGCCGA GCCGCCGCCG CCCGATGGGC TGGGCCGTGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCAGC CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCAGCGTA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 2 84 
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Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
1 5 10 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 * 3 0 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GCG CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
50 55 60 

ATC CCC GTC CGA CAG AAC TGC CTC ACG GTT AAC CGG TCT CTC TTC CTC 4 76 

lie Pro Val Arg Gin Asn Cys Leu Thr Val Asn Arg Ser Leu Phe Leu 
65 70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 524 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys lie Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 572 
Trp Pro Pro Phe Glu Tyr Met lie Leu Ala Thr He He Ala Asn Cys 
100 105 110 

ATC GTC CTC GCA CTG GAG CAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 620 
He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 668 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 716 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
145 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 764 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACG GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 812 
Val Leu Thr Gly He Leu Ala Thr Val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 86 0 

Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA CAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 90 8 

He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT 956 
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Pro Leu Leu Gin lie Gly Leu Leu Leu Phe Phe Ala lie Leu lie Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 1004 
Ala lie lie Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

TTT GAA GAG GGG ACA GAT GAC ATT CAG GGT GAG TCT CCG GCT CCA TGT 1052 
Phe Glu Glu Gly Thr Asp Asp lie Gin Gly Glu Ser Pro Ala Pro Cys 
260 265 270 

GGG ACA GAA GAG CCC GCC CGC ACC TGC CCC AAT GGG ACC AAA TGT CAG 1100 
Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 

CCC TAC TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 1148 
Pro Tyr Trp Glu Gly Pro Asn Asn Gly He Thr Gin Phe Asp Asn He 
290 295 300 

CTG TTT GCA GTG CTG ACT GTT TTC CAG TGC ATA ACC ATG GAA GGG TGG 1196 
Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Trp 
305 310 315 320 

ACT GAT CTC CTC TAC AAT AGC AAC GAT GCC TCA GGG AAC ACT TGG AAC 1244 
Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

TGG TTG TAC TTC ATC CCC CTC ATC ATC ATC GGC TCC TTT TTT ATG CTG 1292 
Trp Leu Tyr Phe He Pro Leu He lie lie Gly Ser Phe Phe Met Leu 
340 345 350 

AAC CTT GTG CTG GGT GTG CTG TCA GGG GAG TTT GCC AAA GAA AGG GAA 134 0 

Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arg Glu 
355 360 365 

CGG GTG GAG AAC CGG CGG GCT TTT CTG AAG CTG AGG CGG CAA CAA CAG 1388 
Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin Gin Gin 
370 375 * 380 

ATT GAA CGT GAG CTC AAT GGG TAC ATG GAA TGG ATC TCA AAA GCA GAA 1436 
He Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp lie Ser Lys Ala Glu 
385 390 395 400 

GAG GTG ATC CTC GCC GAG GAT GAA ACT GAC GGG GAG CAG AGG CAT CCC 14 84 

Glu Val He Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 
405 410 415 

TTT GAT GGA GCT CTG CGG AGA ACC ACC ATA AAG AAA AGC AAG ACA GAT 1532 
Phe Asp Gly Ala Leu Arg Arg Thr Thr He Lys Lys Ser Lys Thr Asp 
420 425 - 430 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 1580 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Ala Asp He Ala Ser 
435 440 445 

GTG GGT TCT CCC TTC GCC CGA GCC AGC ATT AAA AGT GCC AAG CTG GAG 162 8 
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Val Gly Ser Pro Phe Ala Arg Ala Ser lie Lys Ser Ala Lys Leu Glu 
450 455 460 

AAC TCG ACC TTT TTT CAC AAA AAG GAG AGG AGG ATG CGT TTC TAC ATC 1676 
Asn Ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Phe Tyr lie 
465 470 475 480 

CGC CGC ATG GTC AAA ACT CAG GCC TTC TAC TGG ACT GTA CTC AGT TTG 1724 
Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu Ser Leu 
485 490 495 

GTA GCT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 1772 
Val Ala Leu Asn Thr Leu Cys Val Ala lie Val His Tyr Asn Gin Pro 
500 505 510 

GAG TGG CTC TCC GAC TTC CTT TAC TAT GCA GAA TTC ATT TTC TTA GGA 1820 
Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe lie Phe Leu Gly 
515 520 525 

CTC TTT ATG TCC GAA ATG TTT ATA AAA ATG TAC GGG CTT GGG ACG CGG 1868 
Leu Phe Met Ser Glu Met Phe lie Lys Met Tyr Gly Leu Gly Thr Arg 
530 535 540 

CCT TAC TTC CAC TCT TCC TTC AAC TGC TTT GAC TGT GGG GTT ATC ATT 1916 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val lie lie 
545 550 555 560 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 1964 
Gly Ser lie Phe Glu Val lie Trp Ala Val He Lys Pro Gly Thr Ser 
565 570 575 

TTT GGA ATC AGC GTG TTA CGA GCC CTC AGG TTA TTG CGT ATT TTC AAA 2012 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

GTC ACA AAG TAC TGG GCA TCT CTC AGA AAC CTG GTC GTC TCT CTC CTC 206 0 

Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 600 605 

AAC TCC ATG AAG TCC ATC ATC AGC CTG TTG TTT CTC CTT TTC CTG TTC 2108 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 
610 615 620 

ATT GTC GTC TTC GCC CTT TTG GGA ATG CAA CTC TTC GGC GGC CAG TTT 2156 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
.625 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 22 04 

Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATG ACG GTG TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 2252 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG 23 00 



BNSDOCID: <WO 9504822A1_I_> 



WO 95/04822 



PCT/US94/09230 



2348 



2396 



2444 
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Val Met Tyr Asp Gly lie Lys Ser Gin Gly Gly Val Gin Gly Gly Met 
675 680 6 85 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 

—5 Ser Ile ^ Phe Ile Val Leu Thr Leu Gly Asn Tyr Thr 

690 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAC AAT CTG GCC AAC GCC 
Leu Leu Asn Val Phe Leu Ala Ile Ala Val Asp Asn Leu Ala Asn Ala 
705 71 ° 715 720 

CAG GAG CTC ACC AAG GTG GAG GCG GAC GAG CAA GAG GAA GAA GAA GCA 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
7 25 730 735 

GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 2492 
Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 750 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 
Ser Pro Leu Ser Ala Ala Asn Met Ser Ile Ala Val Lys Glu Gin Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CGG ACC AGT GAG 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
770 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 2636 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 79 ° 795 800 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 
Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arq 
805 810 ~ 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTG GAC CCG CAG 2 732 

Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 2780 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Glu Pro Thr 
835 840 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 2828 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu Asp Phe Leu Arg Lys 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 
Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Gly 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 2 924 

Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 2972 



2684 



2876 
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Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GGG TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 3 020 

Ser Leu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 925 

GCC GGG GAC CCC CAC CGG AGG CAC GTG CAC CGG CAG GGG GGC AGC AGG 3 068 

Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arg 
930 935 940 

GAG AGC CGC AGC GGG TCC CCG CGC ACG GGC GCG GAC GGG GAG CAT CGA 3116 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
945 950 955 960 

CGT CAT CGC GCG CAC CGC AGG CCC GGG GAG GAG GGT CCG GAG GAC AAG 3164 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG GAG CGG AGG GCG CGG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 3212 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 3260 
Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
995 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 3308 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 " 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 3356 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC CCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 34 04 

Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro lie Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 3452 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp lie Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 3500 
Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 354 8 

Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 1100 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 3 596 

Thr Asp Pro Gly Pro Met Leu Ala He Pro Ala Met Ala Thr Asn Pro 
1105 1110 H15 1120 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GGG AAC CCA TCC 3644 
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2 2 S2 2 2 2 2 His 2 s a 2 £2 2 2 



1235 "40 1245 

TTT GAG ATG TGC ATC CTC ATG GTC ATT GCC ATG AGC AGC ATC GCC rrr 
Phe Glu Met cys He Leu Met Val He Ala Me? Jer Ser S 111 

1255 1260 



^ G ^ CCT GTG CAG CCC AAC CCA CCT CGG AAC AAC GTG CTG 
Ala Ala Glu Asp Pro Val Gin Pro Asn Ala Pro Arg ten tel S2 2u 

1270 1275 128 0 

CGA TAC TTT GAC TAC GTT TTT ACA GGC GTC TTC ACC TTT GAG ATG CTr 
Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe tS III Su vll 
1285 1290 X 29S 

111 iZt t? T GAC T CTG GGG CTC GTC CTG ^ °AG CGT GCC TAC TTC 

He Lys Met lie Asp Leu Gly Leu Val Leu His Gin Gly Ala Tyr Phe 
1300 1305 * 1310 * 

CGT GAC CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC AGT GGG GCC CTG 
Arg Asp Leu Tip Asn He Leu Asp Phe He Val Val ill Sy £S Leu 
"IS 1320 132 5 



f/I A GC ° IF G ? C ACT GGC AAT AGC AAA GGA AAA GAC ATC AAC ACG 

Val Ala Phe Ala Phe Thr Gly Asn Ser Lys Gly Lys Asp He tel Jhr 

1335 1340 

ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 



3740 



Gin Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 
H25 H30 1135 

2 2 25 5 2 2 2 2 2 2 £ 2 2 !5 2 2 3692 

1145 1150 

Pro A fS ACC ACC AAT TCA GCT AAG ACT GCC AGG AAA CCC GAC 

Pro ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg ^ Pro 
■ L15S H60 ' 1165 

SS £S 2 SI 2 SS S 2 S SS 2 2 Si »- 
22 2 a S3 22 222 2 22 2 2 Si 

1195 1200 

JJs* G A u o AG f* 3 ^ GAG GAG CAG GAA GAC GAC CGT GGG GAA GAC 

Lys Glu Glu Glu Lys L ys Glu Glu Glu Glu Asp Asp Arg Gly S! tep 
1205 1210 1215 ** 

2 2 2 £5 2 2 Tyr 2 2 2 2 2 2 2 2 '~ 

1225 !23o 



3836 



3884 



3980 



4028 



4076 



4124 



4172 



4220 



4268 



4316 
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II e Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr lie 
1345 1350 1355 1360 

AAG CGG CTG CCA AAG CTC AAG GCT GTG TTT GAC TGT GTG GTG AAC TCA 4364 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 4412 
Leu Lys Asn Val Phe Asn lie Leu lie Val Tyr Met Leu Phe Met Phe 
1380 1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 4460 
lie Phe Ala Val Val Ala Val Gin Leu Phe Lys Gly Lys Phe Phe His 
1395 1400 1405 

TGC ACT GAC GAG TCC AAA GAG TTT GAG AAA GAT TGT CGA GGC AAA TAC 4508 
Cys Thr Asp Glu Ser Lys Glu Phe Glu Lys Asp Cys Arg Gly Lys Tyr 
1410 1415 1420 

CTC CTC TAC GAG AAG AAT GAG GTG AAG GCG CGA GAC CGG GAG TGG AAG 4556 
Leu Leu Tyr Glu Lys Asn Glu Val Lys Ala Arg Asp Arg Glu Trp Lys 
1425 * " 1430 ~ 1435 " * 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 46 04 

Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu 
1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA GGC TGG CCA CAG GTC CTC AAG CAT TCG 4652 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 
1460 1465 1470 

GTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGC CCC GGG TAC CGC ATG 4 700 

Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 4 74 8 

Glu Met Ser lie Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 
1490 1495 1500 

TTT GTC AAT ATC TTT GTG GCC TTG ATC ATC ATC ACC TTC CAG GAG CAA 4796 
Phe Val Asn lie Phe Val Ala Leu lie lie He Thr Phe Gin Glu Gin 
1505 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTG GAG AAA AAT GAG AGG GCC 4 844 

Gly Asp Lys Met Met Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
1525 1530 1535 

TGC ATT GAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATG CCG 4 8 92 

Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
1540 1545 1550 

CAG AAC AAG CAG AGC TTC CAG TAC CGC ATG TGG CAG TTC GTG GTG TCT 494 0 

Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 

CCG CCT TTC GAG TAC ACG ATC ATG GCC ATG ATC GCC CTC AAC ACC ATC 4 988 
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Pr ° ?^n Phe Glu Tyr Thr Ile Met Ala Met Ile Ala Leu Asn Thr lie 
1570 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT GCC 
Ycoc LeU Met Met Lys Phe ^Vr G1 y Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1600 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CTG GAA TGT 5084 
Leu Arg Val Phe Asn Ile Val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 1610 1615 



5036 



5132 



GTG CTG AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 
Val Leu Lys Val Met Ala Phe Gly Ile Leu Asn Tyr Phe Arg Asp Ala 
1620 1625 1630 

TGG AAC ATC TTC GAC TTT GTG ACT GTT CTG GGC AGC ATC ACC GAT ATC 518 0 

Trp Asn lie Phe Asp Phe Val Thr Val Leu Gly Ser Ile Thr Asp Ile 
1635 1640 1645 

CTC GTG ACT GAG TTT GGG AAT CCG AAT AAC TTC ATC AAC CTG AGC TTT 
Y^L Thr Glu phe G1 y Asn Pro Asn Asn Phe Ile Asn Leu Ser Phe 
1650 1655 1660 * " 

CTC CGC CTC TTC CGA GCT GCC CGG CTC ATC AAA CTT CTC CGT CAG GGT 

5t« 9 Phe Ala Ala Leu Ile L ^ s Leu Leu Arg Gin Gly 

1665 "70 1675 ~ 1680 

TAC ACC ATC CGC ATT CTT CTC TGG ACC TTT GTG CAG TCC TTC AAG GCC 
Tyr Thr Ile Arg He Leu Leu Trp Thr Phe Val Gin Ser Phe Lys Ala 
1^85 1690 1695 

CTG CCT TAT GTC TGT CTG CTG ATC GCC ATG CTC TTC TTC ATC TAT GCC 
Leu Pro Tyr Val Cys Leu Leu Ile Ala Met Leu Phe Phe Ile Tyr Ala 
1 7 °0 1705 1710 

ATC ATT GGG ATG CAG GTG TTT GGT AAC ATT GGC ATC GAC GTG GAG GAC 
lie Ile Gly Met Gin Val Phe Gly Asn Ile Gly Ile Asp Val Glu Asp 
1715 1720 1725 

GAG GAC AGT GAT GAA GAT GAG TTC CAA ATC ACT GAG CAC AAT AAC TTC 
Glu Asp Ser Asp Glu Asp Glu Phe Gin Ile Thr Glu His Asn Asn Phe 
1730 1735 1740 

CGG ACC TTC TTC CAG GCC CTC ATG CTT CTC TTC CGG AGT GCC ACC GGG 
Arg Thr Phe Phe Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Glv 
1745 1750 1755 1760 

GAA GCT TGG CAC AAC ATC ATG CTT TCC TGC CTC AGC GGG AAA CCG TGT 
Glu Ala Trp His Asn Ile Met Leu Ser Cys Leu Ser Gly Lys Pro Cys 
1765 1770 1775 

GAT AAG AAC TCT GGC ATC CTG ACT CGA GAG TGT GGC AAT GAA TTT GCT 
Asp Lys Asn Ser Gly Ile Leu Thr Arg Glu Cys Gly Asn Glu Phe Ala 
1780 1785 1790 

TAT TTT TAC TTT GTT TCC TTC ATC TTC CTC TGC TCG TTT CTG ATG CTG 5660 



5228 



5276 



5324 



5372 



5420 



5468 



5516 



5564 



5612 



BNSDOCID: <WO 9504822A1 J_ 



WO 95/04822 



PCT/US94/09230 



-187- 



Tyr 


Phe 


Tyr Phe 
1795 


Val 


Ser 


Phe 


He Phe 
1800 


Leu 


Cys 


Ser 


Phe Leu 
1805 


Met 


Leu 


AAT 
Asn 


CTC TTT 
Leu Phe 
1810 


GTC 
Val 


GCC 
Ala 


GTC 
Val 


ATC ATG 
He Met 
1815 


GAC 
Asp 


AAC 
Asn 


TTT 
Phe 


GAG TAC 
Glu Tyr 
1820 


CTC ACC CGA 
Leu Thr Arg 


GAC TCC 
Asp Ser 
1825 


TCC 
Ser 


ATC 
He 


CTG 
Leu 


GGC CCC 
Gly Pro 
1830 


CAC 
His 


CAC 
His 


CTG 
Leu 


GAT GAG 
Asp Glu 
1835 


TAC 
Tyr 


GTG CGT GTC 
Val Arg Val 
1840 


TGG 
Trp 


GCC 
Ala 


GAG 
Glu 


TAT GAC CCC 
Tyr Asp Pro 
1845 


GCA 
Ala 


GCT 
Ala 


TGG 
Trp 


GGC CGC 
Gly Arg 
1850 


ATG 
Met 


CCT 
Pro 


TAC 
Tyr 


CTG GAC 
Leu Asp 
1855 


ATG 
Met 


TAT 
Tyr 


CAG 
Gin 


ATG CTG 
Met Leu 
1860 


AGA 
Arg 


CAC 
His 


ATG 
Met 


TCT CCG 
Ser Pro 
1865 


CCC 
Pro 


CTG 
Leu 


GGT 
Gly 


CTG GGG 
Leu Gly 
1870 


AAG 
Lys 


AAG 
Lys 


TGT 
Cys 


CCG GCC 
Pro Ala 
1875 


AGA 
Arg 


GTG 
Val 


GCT 
Ala 


TAC AAG 
Tyr Lys 
1880 


CGG 
Arg 


CTT 
Leu 


CTG 
Leu 


CGG ATG 
Arg Met 
1885 


GAC 
Asp 


CTG 
Leu 


CCC 
Pro 


GTC GCA 
Val Ala 
1890 


GAT 
Asp 


GAC 
Asp 


AAC 
Asn 


ACC GTC 
Thr Val 
1895 


CAC 
His 


TTC 
Phe 


AAT 
Asn 


TCC ACC 
Ser Thr 
1900 


CTC 
Leu 


ATG 
Met 


GCT 
Ala 


CTG ATC 
Leu He 
1905 


CGC 
Arg 


ACA 
Thr 


GCC 
Ala 


CTG GAC 
Leu Asp 
1910 


ATC 
He 


AAG 
Lys 


ATT 
He 


GCC AAG 
Ala Lys 
1915 


GGA 
Gly 


GGA 
Gly 


GCC 
Ala 


GAC 
Asp 
1920 


AAA 
Lys 


CAG 
Gin 


CAG 
Gin 


ATG 
Met 


GAC GCT 
Asp Ala 
1925 


GAG 
Glu 


CTG 
Leu 


CGG 
Arg 


AAG GAG 
Lys Glu 
1930 


ATG 
Met 


ATG 
Met 


GCG 
Ala 


ATT TGG 
He Trp 
1935 


CCC 
Pro 


AAT 
Asn 


CTG 
Leu 


TCC CAG 
Ser Gin 
1940 


AAG 
Lys 


ACG 
Thr 


CTA 
Leu 


GAC CTG 
Asp Leu 
1945 


CTG 
Leu 


GTC 
Val 


ACA 
Thr 


CCT CAC 
Pro His 
1950 


AAG 
Lys 


TCC 
Ser 


ACG 
Thr 


GAC CTC 
Asp Leu 
1955 


ACC 
Thr 


GTG 
Val 


GGG 
Gly 


AAG ATC 
Lys He 
1960 


TAC 
Tyr 


GCA 
Ala 


GCC 
Ala 


ATG ATG 
Met Met 
1965 


ATC 
He 


ATG 
Met 


GAG 
Glu 


TAC TAC CGG 
Tyr Tyr Arg 
1970 


CAG 
Gin 


AGC 
Ser 


AAG GCC 
Lys Ala 
1975 


AAG 
Lys 


AAG 
Lys 


CTG 
Leu 


CAG GCC 
Gin Ala 
1980 


ATG 
Met 


CGC 
Arg 


GAG 
Glu 


GAG CAG 
Glu Gin 
1985 


GAC 
Asp 


CGG 
Arg 


ACA 
Thr 


CCC CTC 
Pro Leu 
1990 


ATG 
Met 


TTC 
Phe 


CAG 
Gin 


CGC ATG 
Arg Met 
1995 


GAG 
Glu 


CCC 
Pro 


CCG 
Pro 


TCC 
Ser 
2000 


CCA 
Pro 


ACG 
Thr 


CAG 
Gin 


GAA 
Glu 


GGG GGA 
Gly Gly 
2005 


CCT 
Pro 


GGC 
Gly 


CAG 
Gin 


AAC GCC 
Asn Ala 
2010 


CTC 
Leu 


CCC 
Pro 


TCC 
Ser 


ACC CAG 
Thr Gin 
2015 


CTG 


GAC 


CCA 


GGA 


GGA 


GCC 


CTG 


ATG 


GCT 


CAC 


GAA 


AGC 


GGC 


CTC 


AAG 


GAG 
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Leu Asp Pro Gly Gly Ala Leu Met Ala His Glu Ser Gly Leu Lys Glu 
2020 2025 2030 

AGC CCG TCC TGG GTG ACC CAG CGT GCC CAG GAG ATG TTC CAG AAG ACG 6380 
Ser Pro Ser Trp Val Thr Gin Arg Ala Gin Glu Met Phe Gin Lys Thr 
2035 2040 2045 

GGC ACA TGG AGT CCG GAA CAA GGC CCC CCT ACC GAC ATG CCC AAC AGC 6428 
Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro Asn Ser 
2050 2055 2060 

CAG CCT AAC TCT CAG TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 6476 
Gin Pro Asn Ser Gin Ser Val Glu Met Arg Glu Met Gly Arg Asp Gly 
2065 2070 2075 2080 

TAC TCC GAC AGC GAG CAC TAC CTC CCC ATG GAA GGC CAG GGC CGG GCT 6524 
Tyr Ser Asp Ser Glu His Tyr Leu Pro Met Glu Gly Gin Gly Arg Ala 
2085 2090 * 2095 

GCC TCC ATG CCC CGC CTC CCT GCA GAG AAC CAG AGG AGA AGG GGC CGG 6572 
Ala Ser Met Pro Arg Leu Pro Ala Glu Asn Gin Arg Arg Arg Gly Arg 
2100 2105 ~ 2110 

CCA CGT GGG AAT AAC CTC AGT ACC ATC TCA GAC ACC AGC CCC ATG AAG 662 0 

Pro Arg Gly Asn Asn Leu Ser Thr He Ser Asp Thr Ser Pro Met Lys 
2115 2120 2125 

CGT TCA GCC TCC GTG CTG GGC CCC AAG GCC CGA CGC CTG GAC GAT TAC 666 8 

Arg Ser Ala Ser Val Leu Gly Pro Lys Ala Arg Arg Leu Asp Asp Tyr 
2130 2135 2140 

TCG CTG GAG CGG GTC CCG CCC GAG GAG AAC CAG CGG CAC CAC CAG CGG 6716 
Ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin Arg His His Gin Arg 
2145 2150 2155 2160 

CGC CGC GAC CGC AGC CAC CGC GCC TCT GAG CGC TCC CTG GGC CGC TAC 6764 
Arg Arg Asp Arg Ser His Arg Ala Ser Glu Arg Ser Leu Gly Arg Tyr 
2165 2170 2175 

ACC GAT GTG GAC ACA GGC TTG GGG ACA GAC CTG AGC ATG ACC ACC CAA 6812 
Thr Asp Val Asp Thr Gly Leu Gly Thr Asp Leu Ser Met Thr Thr Gin 
2180 2185 2190 

TCC GGG GAC CTG CCG TCG AAG GAG CGG GAC CAG GAG CGG GGC CGG CCC 6 86 0 

Ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Arg Pro 
2195 2200 2205 

AAG GAT CGG AAG CAT CGA CAG CAC CAC CAC CAC CAC CAC CAC CAC CAC 6908 
Lys Asp Arg Lys His Arg Gin His His His His His His His His His 
2210 2215 2220 

CAT CCC CCG CCC CCC GAC AAG GAC CGC TAT GCC CAG GAA CGG CCG GAC 6 956 

Hxs Pro Pro Pro Pro Asp Lys Asp Arg Tyr Ala Gin Glu Arg Pro Asp 
2225 2230 2235 ~ 2240 

CAC GGC CGG GCA CGG GCT CGG GAC CAG CGC TGG TCC CGC TCG CCC AGC 7004 
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His Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2255 

GAG GGC CGA GAG CAC ATG GCG CAC CGG CAG GGC AGT AGT TCC GTA AGT 7052 
Glu Gly Arg Glu His Met Ala His Arg Gin Gly Ser Ser Ser Val Ser 
2260 2265 2270 

GGA AGC CCA GCC CCC TCA ACA TCT GGT ACC AGC ACT CCG CGG CGG GGC 7100 
Gly Ser Pro Ala Pro Ser Thr Ser Gly Thr Ser Thr Pro Arg Arg Gly 
2275 2280 2285 

CGC CGC CAG CTC CCC CAG ACC CCC TCC ACC CCC CGG CCA CAC GTG TCC 714 8 

Arg Arg Gin Leu Pro Gin Thr Pro Ser Thr Pro Arg Pro His Val Ser 
2290 2295 2300 

TAT TCC CCT GTG ATC CGT AAG GCC GGC GGC TCG GGG CCC CCG CAG CAG 7196 
Tyr Ser Pro Val lie Arg Lys Ala Gly Gly Ser Gly Pro Pro Gin Gin 
2305 2310 2315 2320 

CAG CAG CAG CAG CAG CAG CAG CAG CAG GCG GTG GCC AGG CCG GGC CGG 7244 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Ala Val Ala Arg Pro Gly Arg 
2325 2330 2335 

GCG GCC ACC AGC GGC CCT CGG AGG TAC CCA GGC CCC ACG GCC GAG CCT 7292 
Ala Ala Thr Ser Gly Pro Arg Arg Tyr Pro Gly Pro Thr Ala Glu Pro 
2340 2345 2350 

CTG GCC GGA GAT CGG CCG CCC ACG GGG GGC CAC AGC AGC GGC CGC TCG 734 0 

Leu Ala Gly Asp Arg Pro Pro Thr Gly Gly His Ser Ser Gly Arg Ser 
2355 2360 2365 

CCC AGG ATG GAG AGG CGG GTC CCA GGC CCG GCC CGG AGC GAG TCC CCC 73 8 8 

Pro Arg Met Glu Arg Arg Val Pro Gly Pro Ala Arg Ser Glu Ser Pro 
2370 2375 2380 

AGG GCC TGT CGA CAC GGC GGG GCC CGG TGG CCG GCA TCT GGC CCG CAC 74 3 6 

Arg Ala Cys Arg His Gly Gly Ala Arg Trp Pro Ala Ser Gly Pro His 
2385 * ~ 2390 2395 2400 

GTG TCC GAG GGG CCC CCG GGT CCC CGG CAC CAT GGC TAC TAC CGG GGC 74 84 

Val Ser Glu Gly Pro Pro Gly Pro Arg His His Gly Tyr Tyr Arg Gly 
2405 2410 2415 

TCC GAC TAC GAC GAG GCC GAT GGC CCG GGC AGC GGG GGC GGC GAG GAG 7532 
Ser Asp Tyr Asp Glu Ala Asp Gly Pro Gly Ser Gly Gly Gly Glu Glu 
2420 2425 2430 

GCC ATG GCC GGG GCC TAC GAC GCG CCA CCC CCC GTA CGA CAC GCG TCC 758 0 

Ala Met Ala Gly Ala Tyr Asp Ala Pro Pro Pro Val Arg His Ala Ser 
2435 2440 2445 

TCG GGC GCC ACC GGG CGC TCG CCC AGG ACT CCC CGG GCC TCG GGC CCG 762 8 

Ser Gly Ala Thr Gly Arg Ser Pro Arg Thr Pro Arg Ala Ser Gly Pro 
2450 2455 2460 

GCC TGC GCC TCG CCT TCT CGG CAC GGC CGG CGA CTC CCC AAC GGC TAC 7676 
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Ala Cys Ala Ser Pro Ser Arg His Gly Arg Arg Leu Pro Asn Gly Tyr 
24€5 2470 * 2475 2480 

TAC CCG GCG CAC GGA CTG GCC AGG CCC CGC GGG CCG GGC TCC AGG AAG 7724 
Tyr Pro Ala His Gly Leu Ala Arg Pro Arg Gly Pro Gly Ser Arg Lys 
2485 2490 ' 2495 

GGC CTG CAC GAA CCC TAC AGC GAG AGT GAC GAT GAT TGG TGC TAAGCCCGGG 7776 
Gly Leu His Glu Pro Tyr Ser Glu Ser Asp Asp Asp Trp Cys 
2500 2505 2510 

CGAGGTGGCG CCCGCCCGGC CCCCCACGCA CC 7808 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 237.. 7037 

(D) OTHER INFORMATION: /standard__name= "Alpha- 1A- 2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CCGGGCAGCC GCCTTCTGAG CCCCCGACCC 60 

GAGGCGCCGA GCCGCCGCCG CCCGATGGGC TGGGCCGTGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCAGC CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCAGCGTA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 284 
Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
1 5 10 "* 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 30 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GCG CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
50 55 60 
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ATC CCC GTC CGA CAG AAC TGC CTC ACG GTT AAC CGG TCT CTC TTC CTC 4 76 

lie Pro Val Arg Gin Asn Cys Leu Thr Val Asn Arg Ser Leu Phe Leu 
65 70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 524 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys lie Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 572 
Trp Pro Pro Phe Glu Tyr Met lie Leu Ala Thr He He Ala Asn Cys 
100 ^ 105 110 

ATC GTC CTC GCA CTG GAG CAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 620 
He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 668 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 ~ 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 716 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
145 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 764 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACG GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 812 
Val Leu Thr Gly He Leu Ala Thr Val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 86 0 

Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA GAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 908 
He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT 956 
Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala He Leu He Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 1004 
Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

TTT GAA GAG GGG ACA GAT GAC ATT CAG GGT GAG TCT CCG GCT CCA TGT 1052 
Phe Glu Glu Gly Thr Asp Asp He Gin Gly Glu Ser Pro Ala Pro Cys 
260 265 270 

GGG ACA GAA GAG CCC GCC CGC ACC TGC CCC AAT GGG ACC AAA TGT CAG 1100 
Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 
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CCC TAC TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 

Pr ° Jg TrP G1U Gly Pr ° Asn G1 y Ile Thr Gin III Sp £e 

295 300 

FJu J£ a^ v T ? r CTG £ T GT ? TTC ^ TC C ATA ACC ATG GAA GGG TGG 
Leu Phe Ala Val Leu Thr Val Phe Gin Cys lie Thr Met Glu Gly Trp 

££• 1*1 t CTC r CTC TAC ** T AGC ™ C CAT GCC TCA GGG AAC ACT TGG AAC 
Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

TGG TTG TAC TTC ATC CCC CTC ATC ATC ATC GGC TCC TTT TTT ATG OTV 
Trp Leu Tyr Phe lie Pro Leu lie lie He Gly Ser III 22 2u 
J *° 345 350 

AAC CTT GTG CTG GGT GTG CTG TCA GGG GAG TTT GCC AAA GAA AGG GAA 
Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arg Glu 

Sa vl? ^ 5*° CGG CGG GCT TTT CTG AAG CTG AGG CGG CAA CAA CAG 
Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin SJn 

375 380 

tTI ^ ° GT S AG CTC GGG TAG ATG GAA TGG ATC TCA AAA GCA GAA 

lie Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp lie IS iyV Si 

390 395 400 

G?2 S2 T?° t C I C GCC GAG GAT GAA ACT GAC GGG GAG CAG AGG CAT CCC 
Glu Val He Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 

405 410 415 

GAT GGA G =T CTG CGG AGA ACC ACC ATA AAG AAA AGC AAG ACA GAT 
Phe Asp Gly Ala Leu Arg Arg Thr Thr lie Lys Lys Ser Eys Thr Sp 
420 425 430 * 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Ala Sp lie 22 sJr 
435 440 445 

vlt SJ III o° C ^° G ?° CGA GCC AGC ATT m AGT GCC AAG CTG GAG 
Val Gly Ser Pro Phe Ala Arg Ala Ser He Lys Ser Ala Lys Leu Glu 

«° 455 460 

AAC TCG ACC TTT TTT CAC AAA AAG GAG AGG AGG ATG CGT TTC TAC ATC 
Asn Ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Phe Tyr lie 
465 470 475 48 o 

a™ £ GC m ACT GCC TTC TAC TGG ACT GTA CTC AGT TTG 

Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu Ser Leu 
485 490 495 

GTA GCT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 
Val Ala Leu Asn Thr Leu Cys Val Ala He Val His Tyr Asn Gin Pro 
500 505 510 



1148 



1196 



1244 



1292 



1340 



1388 
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GAG TGG CTC TCC GAC TTC CTT TAC TAT GCA GAA TTC ATT TTC TTA GGA 182 0 

Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe lie Phe Leu Gly 
515 520 525 

CTC TTT ATG TCC GAA ATG TTT ATA AAA ATG TAC GGG CTT GGG ACG CGG 1868 
Leu Phe Met Ser Glu Met Phe lie Lys Met Tyr Gly Leu Gly Thr Arg 
530 535 540 

CCT TAC TTC CAC TCT TCC TTC AAC TGC TTT GAC TGT GGG GTT ATC ATT 1916 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val lie lie 
545 550 555 560 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 1964 
Gly Ser He Phe Glu Val He Trp Ala Val He Lys Pro Gly Thr Ser 
565 570 575 

TTT GGA ATC AGC GTG TTA CGA GCC CTC AGG TTA TTG CGT ATT TTC AAA 2012 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

GTC ACA AAG TAC TGG GCA TCT CTC AGA AAC CTG GTC GTC TCT CTC CTC 2 060 

Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 600 605 

AAC TCC ATG AAG TCC ATC ATC AGC CTG TTG TTT CTC CTT TTC CTG TTC 2108 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 
610 615 620 

ATT GTC GTC TTC GCC CTT TTG GGA ATG CAA CTC TTC GGC GGC CAG TTT 2156 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
625 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 2204 
Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATG ACG GTG TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 2252 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG 23 00 

Val Met Tyr Asp Gly He Lys Ser Gin Gly Gly Val Gin Gly Gly Met 
675 680 685 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 234 8 

Val Phe Ser He Tyr Phe He Val Leu Thr Leu Phe Gly Asn Tyr Thr 
690 ** 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAC AAT CTG GCC AAC GCC 2396 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asn Ala 
705 710 715 720 

CAG GAG CTC ACC AAG GTG GAG GCG GAC GAG CAA GAG GAA GAA GAA GCA 2444 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
725 730 735 
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GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 24 92 

Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 750 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 2540 
Ser Pro Leu Ser Ala Ala Asn Met Ser He Ala Val Lys Glu Gin Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CGG ACC AGT GAG 2588 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
770 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 2636 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 790 795 800 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 26 84 

Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arg 
805 810 " 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTG GAC CCG CAG 2732 
Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 2780 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Glu Pro Thr 
835 840 " 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 2828 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu Asp Phe Leu Arg Lys 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 28 76 

Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Glv 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 2924 
Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 2972 
Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GGG TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 3020 
Ser Leu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 " 925 

GCC GGG GAC CCC CAC CGG AGG CAC GTG CAC CGG CAG GGG GGC AGC AGG 3 068 

Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arq 
930 935 940 

GAG AGC CGC AGC GGG TCC CCG CGC ACG GGC GCG GAC GGG GAG CAT CGA 3116 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
94 * 950 955 960 
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CGT CAT CGC GCG CAC CGC AGG CCC GGG GAG GAG GGT CCG GAG GAC AAG 3164 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG GAG CGG AGG GCG CGG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 3212 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 326 0 

Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
995 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 3308 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 3356 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC CCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 3404 
Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro lie Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 3452 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp lie Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 3 500 

Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 354 8 

Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 1100 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 3596 
Thr Asp Pro Gly Pro Met Leu Ala lie Pro Ala Met Ala Thr Asn Pro 
1105 1110 1115 1120 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GGG AAC CCA TCC 3644 
Gin Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 
1125 1130 1135 

AAT CCC GGC CCC CCC AAG ACC CCC GAG AAT AGC CTT ATC GTC ACC AAC 3692 
Asn Pro Gly Pro Pro Lys Thr Pro Glu Asn Ser Leu lie Val Thr Asn 
1140 1145 1150 

CCC AGC GGC ACC CAG ACC AAT TCA GCT AAG ACT GCC AGG AAA CCC GAC 374 0 

Pro Ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg Lys Pro Asp 
1155 1160 1165 

CAC ACC ACA GTG GAC ATC CCC CCA GCC TGC CCA CCC CCC CTC AAC CAC 3788 
His Thr Thr Val Asp lie Pro Pro Ala Cys Pro Pro Pro Leu Asn His 
1170 1175 1180 
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3884 



3932 



3980 



ACC GTC GTA CAA GTG AAC AAA AAC GCC AAC CCA GAC CCA CTG CCA AAA 3836 
Thr Val Val Gin Val Asn Lys Asn Ala Asn Pro Asp Pro Leu Pro Lys 
1185 H90 H95 1200 

AAA GAG GAA GAG AAG AAG GAG GAG GAG GAA GAC GAC CGT GGG GAA GAC 
Lys Glu Glu Glu Lys Lys Glu Glu Glu Glu Asp Asp Arg Gly Glu Asp 
1205 1210 ~ 1215 

GGC CCT AAG CCA ATG CCT CCC TAT AGC TCC ATG TTC ATC CTG TCC ACG 
Gly Pro Lys Pro Met Pro Pro Tyr Ser Ser Met Phe He Leu Ser Thr 
1220 1225 1230 

ACC AAC CCC CTT CGC CGC CTG TGC CAT TAC ATC CTG AAC CTG CGC TAC 
Thr Asn Pro Leu Arg Arg Leu Cys His Tyr He Leu Asn Leu Arg Tyr 
1235 1240 1245 

III mI? l° C T?° T CTC ATG GTC ATT GCC ATG AGC AGC ATC Q CC CTG 4028 

?i^ Met Cys Ile Leu Met Val Ile ^ a M et Ser Ser He Ala Leu 
I 250 1255 1260 

GAC CCT GTG ^ CCC GCA CCT CGG AAC AAC GTG CTG 4 076 

Ala Ala Glu Asp Pro Val Gin Pro Asn Ala Pro Arg Asn Asn Val Leu 
1265 1270 1275 1280 

CGA TAC TTT GAC TAC GTT TTT ACA GGC GTC TTC ACC TTT GAG ATG GTG 4124 
Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val 
1285 1290 1295 

ATC AAG ATG ATT GAC CTG GGG CTC GTC CTG CAT CAG GGT GCC TAC TTC 4172 
lie Lys Met Ile Asp Leu Gly Leu Val Leu His Gin Gly Ala Tyr Phe 
1300 1305 1310 

CGT GAC CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC AGT GGG GCC CTG 4220 
Arg Asp Leu Trp Asn Ile Leu Asp Phe Ile Val Val Ser Gly Ala Leu 
1315 1320 1325 

?J A G ? C G ? C TTC ACT GGC AAT AGC AAA GGA AAA GAC ATC AAC ACG 4268 

Val Ala Phe Ala Phe Thr Gly Asn Ser Lys Gly Lys Asp Ile Asn Thr 
1330 1335 " 1340 

ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 4316 
He Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr Ile 
1345 1350 1355 1360 

AAG CGG CTG CCA AAG CTC AAG GCT GTG TTT GAC TGT GTG GTG AAC TCA 4364 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 4412 
Leu Lys Asn Val Phe Asn He Leu Ile Val Tyr Met Leu Phe Met Phe 
1380 1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 44 6 0 

He Phe Ala Val Val Ala Val Gin Leu Phe Lys Gly Lys Phe Phe His 
1395 1400 1405 
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TGC ACT GAC GAG TCC AAA GAG TTT GAG AAA GAT TGT CGA GGC AAA TAC 4508 
Cys Thr Asp Glu Ser Lys Glu Phe Glu Lys Asp Cys Arg Gly Lys Tyr 
1410 1415 1420 

CTC CTC TAC GAG AAG AAT GAG GTG AAG GCG CGA GAC CGG GAG TGG AAG 4556 
Leu Leu Tyr Glu Lys Asn Glu Val Lys Ala Arg Asp Arg Glu Trp Lys 
1425 1430 1435 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 4604 
Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu 
1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA. GGC TGG CCA CAG GTC CTC AAG CAT TCG 4652 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 
1460 1465 1*70 

GTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGC CCC GGG TAC CGC ATC 4700 
Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 4748 
Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 
1490 1495 1500 

TTT GTC AAT ATC TTT GTG GCC TTG ATC ATC ATC ACC TTC CAG GAG CAA 4796 
Phe Val Asn He Phe Val Ala Leu He He He Thr Phe Gin Glu Gin 
1505 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTG GAG AAA AAT GAG AGG GCC 4844 
Gly Asp Lys Met Met Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
* * 1525 1530 1535 

TGC ATT GAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATG CCG 4892 
Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
1540 1545 1550 

CAG AAC AAG CAG AGC TTC CAG TAC CGC ATG TGG CAG TTC GTG GTC TCT 4940 
Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 

CCG CCT TTC GAG TAC ACG ATC ATG GCC ATG ATC GCC CTC AAC ACC ATC 4988 
Pro Pro Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn Thr He 
1570 * 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT GCC 5036 
Val Leu Met Met Lys Phe Tyr Gly Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1600 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CTG GAA TGT 5084 
Su SS Va? Phe itsn He val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 161° 161 

GTG CTC AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 5132 
Va? HI £yt Vai Me? Ala Phe Gly He Leu Asn Tyr Phe Arg Asp Ala 
1620 1625 1630 
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2 2S EE 2 2 2 S5 S IS £ K 2 S 5 "° 



2 22 2 SI g 2g J» 2 S « « „ j« s 

1655 1660 

2S £g S S S ^° S°° CTC ATC m CTT CTC CGT <* G «T 
1665 9 6 *** ^? n Ala Leu Ile L V S Leu Arg Gin Gly 

1670 1675 



1680 



2 S 2 2 52 2 2 2 ES 2 2 2 22 
2 2 2 2 2 2 2 2 222 2 2 2 22 
2 2 22 2 2 2 22 2 2 2 222 2 
2 22 2 2 2 22 2 2 2 2 22 2 2 



1740 



22 2 22 2 2 S 2 2 2 2 2 2 2 2 

1750 1755 17 £ 0 

Si £1 S 2s ™ ill iZ S I cc i GC GTC * GC GGG *** ggg tgt 

^ S ^ n c Ile Met Leu Ser c y s Leu ser Gly Lys Pro Cys 
1765 "70 1775 * 



2 2 2 22 2 2 2 22 2 2 2 2 22 - 

1785 1790 



5276 



5324 



5372 



5420 



5468 



5516 



5564 



2 2 22 2 2 2 2 2 2 2 2 2 2 2 2 - 

1800 1805 

2 22 2 2 2 2 2 2 2 2 2 2 2 2 2 

1815 182 o 

GAC TCC TCC ATC CTG GGC CCC CAC CAC CTG GAT GAG TAC GTG CGT GTP 
Asp_Ser ser lie Leu Gly Pro His His Leu Asp g?S SI? S5 

0 1835 1840 

I™ Sff ^ G I AT GAC CCC GCA GCT TGG GG ^ CGC ATG CCT TAC CTG GAC 
Trp Ala Glu Tyr Asp Pro Ala Ala Trp Gly Arg Met P?o g? 2? Sp 

1B45 1850 1855 



5708 



5756 



5804 
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ATG TAT CAG ATG CTG AGA CAC ATG TCT CCG CCC CTG GGT CTG GGG AAG 5852 
Met Tyr Gin Met Leu Arg His Met Ser Pro Pro Leu Gly Leu Gly Lys 
1860 1865 1870 

AAG TGT CCG GCC AGA GTG GCT TAC AAG CGG CTT CTG CGG ATG GAC CTG 5900 
Lys Cys Pro Ala Arg Val Ala Tyr Lys Arg Leu Leu Arg Met Asp Leu 
1875 ~ 1880 1885 

CCC GTC GCA GAT GAC AAC ACC GTC CAC TTC AAT TCC ACC CTC ATG GCT 5948 
Pro Val Ala Asp Asp Asn Thr Val His Phe Asn Ser Thr Leu Met Ala 
1890 1895 1900 

CTG ATC CGC ACA GCC CTG GAC ATC AAG ATT GCC AAG GGA GGA GCC GAC 5996 
Leu He Arg Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp 
1905 1910 1915 1920 

AAA CAG CAG ATG GAC GCT GAG CTG CGG AAG GAG ATG ATG GCG ATT TGG 6044 
Lys Gin Gin Met Asp Ala Glu Leu Arg Lys Glu Met Met Ala He Trp 
1925 1930 1935 

CCC AAT CTG TCC CAG AAG ACG CTA GAC CTG CTG GTC ACA CCT CAC AAG 6092 
Pro Asn Leu Ser Gin Lys Thr Leu Asp Leu Leu Val Thr Pro His Lys 
1940 1945 1950 

TCC ACG GAC CTC ACC GTG GGG AAG ATC TAC GCA GCC ATG ATG ATC ATG 614 0 

Ser Thr Asp Leu Thr Val Gly Lys He Tyr Ala Ala Met Met He Met 
1955 I960 1965 

GAG TAC TAC CGG CAG AGC AAG GCC AAG AAG CTG CAG GCC ATG CGC GAG 6188 
Glu Tyr Tyr Arg Gin Ser Lys Ala Lys Lys Leu Gin Ala Met Arg Glu 
1970 ~ 1975 1980 

GAG CAG GAC CGG ACA CCC CTC ATG TTC CAG CGC ATG GAG CCC CCG TCC , 6236 

Glu Gin Asp Arg Thr Pro Leu Met Phe Gin Arg Met Glu Pro Pro Ser 
1985 " 1990 1995 2000 

CCA ACG CAG GAA GGG GGA CCT GGC CAG AAC GCC CTC CCC TCC ACC CAG 6284 
Pro Thr Gin Glu Gly Gly Pro Gly Gin Asn Ala Leu Pro Ser Thr Gin 
2005 2010 2015 

CTG GAC CCA GGA GGA GCC CTG ATG GCT CAC GAA AGC GGC CTC AAG GAG 63 32 

Leu Asp Pro Gly Gly Ala Leu Met Ala His Glu Ser Gly Leu Lys Glu 
2020 2025 2030 

AGC CCG TCC TGG GTG ACC CAG CGT GCC CAG GAG ATG TTC CAG AAG ACG 638 0 

Ser Pro Ser Trp Val Thr Gin Arg Ala Gin Glu Met Phe Gin Lys Thr 
2035 2040 2045 

GGC ACA TGG AGT CCG GAA CAA GGC CCC CCT ACC GAC ATG CCC AAC AGC 6428 
Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro Asn Ser 
2050 2055 2060 

CAG CCT AAC TCT CAG TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 64 76 

Gin Pro Asn Ser Gin Ser Val Glu Met Arg Glu Met Gly Arg Asp Gly 
2065 2070 2075 2080 
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TAC TCC GAC AGC GAG CAC TAC CTC CCC ATG GAA GGC CAG GGC CGG GCT 
Tyr Ser Asp Ser Glu His Tyr Leu Pro Met Glu Gly Sn Sy Sg SI 
2085 2090 2095 

JS Ser Se? Pro S t° TC GCA ° AG ^ «° AGG AGA AGG GGC CGG 

Ala Ser Met Pro Arg Leu Pro Ala Glu Asn Gin Arg Arg Arg Gly Arg 

2100 2105 2110 

PrS ZJ G?? iS 52° t CTC AGT A S C ATC TCA GAC ACC AGG CCC ATG AAG 
Pro Arg Gly Asn Asn Leu Ser Thr He Ser Asp Thr Ser Pro Met Lys 

2115 2120 2125 

A™ ^ GCC I CC , G , T ° CTG GGC CCC GCC CGA CGC CTG GAC GAT TAC 

Arg SerAla Ser Val Leu Gl^Pro Lys Ala Arg Arg^Leu Asp ™r 

TCG CTG GAG CGG GTC CCG CCC GAG GAG AAC CAG CGG CAC CAC CAG CGC 
Ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin A?g Ss £s Sfn 2? 

2150 2155 2160 

CGC CGC GAC CGC AGC CAC CGC GCC TCT GAG CGC TCC CTG GGC CGC Tar 
Arg Arg Asp Arg Ser His Arg Ala Ser Glu Zg Ser 22 S£ Sg 
2165 2170 2175 

ACC GAT GTG GAC ACA GGC TTG GGG ACA GAC CTG AGC ATG ACC ACC CAA 
Thr Asp val Asp Thr Gly Leu Gly Thr Asp tlu Jer i£ i£ SS 

2180 2185 2190 

Ser GW ^ T CTG n CG I CG ^ ^ CGG G AC CAG GAG CGG GGC CGG CCC 
Ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Arg Pro 
s 2200 2205 

AAG GAT CGG AAG CAT CGA CAG CAC CAC CAC CAC CAC CAC CAC CAC CAC 
Lys Asp Arg Lys His Arg Gin His His His His His 2£ SS His 2S 

2215 2220 

CAT CCC CCG CCC CCC GAC AAG GAC CGC TAT GCC CAG GAA CGG CCG GAC 
His Pro Pro Pro Pro Asp Lys Asp Arg Tyr Ala Gin SJ Sg £o 
2225 2230 2235 2240 

« A ° ? GC CGG GCA CGG GCT CGG GAC CAG CGC TGG TCC CGC TCG CCC AGC 
Has Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2255 

GAG GGC CGA GAG CAC ATG GCG CAC CGG CAG TAGTTCCGTA AGTGGAAGCC 
Glu Gly Arg Glu His Met Ala His Arg Gin ^x^aagcc 
2260 2265 

CAGCCCCCTC AACATCTGGT ACCAGCACTC CGCGGCGGGG CCGCCGCCAG CTCCCCCAGA 

CCCCCTCCAC CCCCCGGCCA CACGTGTCCT ATTCCCCTGT G ATC CGTAAG GCCGGCGGCT 

CGGGGCCCCC GCAGCAGCAG CAGCAGCAGC AGGCGGTGGC CAGGCCGGGC CGGGCGGCCA 

CCAGCGGCCC TCGGAGGTAC CCAGGCCCCA CGGCCGAGCC TCTGGCCGGA GATCGGCCGC 



6524 

6572 

6620 

6668 

6716 

6764 

6812 

6860 

6908 

6956 

7004 

7054 

7114 
7174 
7234 
7294 
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CCACGGGGGG CCACAGCAGC GGCCGCTCGC CCAGGATGGA GAGGCGGGTC CCAGGCCCGG 7354 

CCCGGAGCGA GTCCCCCAGG GCCTGTCGAC ACGGCGGGGC CCGGTGGCCG GCATCTGGCC 7414 

CGCACGTGTC CGAGGGGCCC CCGGGTCCCC GGCACCATGG CTACTACCGG GGCTCCGACT 7474 

ACGACG AGG C CGATGGCCCG GGCAGCGGGG GCGG CGAGGA GGCCATGGCC GGGGCCTACG 7534 

ACGCGCCACC CCCCGTACGA CACGCGTCCT CGGGCGCCAC CGGGCGCTCG CCCAGGACTC 7594 

CCCGGGCCTC GGGCCCGGCC TGCGCCTCGC CTTCTCGGCA CGGCCGGCGA CTCCCCAACG 7654 

GCTACTACCC GGCGCACGGA CTGGCCAGGC CCCGCGGGCC GGGCTCCAGG AAGGGCCTGC 7714 

ACGAACCCTA CAGCGAGAGT GACGATGATT GGTGCTAAGC CCGGG CGAGG TGGCGCCCGC 7774 

CCGGCCCCCC ACGCACC 7791 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 166,. 6921 

(D) OTHER INFORMATION: /standard_name= "Alpha- IE- 1" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCCAG AGGCGGTGGT CCCCGTGCTT 6C 

GTCTGGATGC GGCTCTGAGT CTCCGTGTGT CTTTCTGCTT GTTGCTGTGT GCGGGTGTTC 12 ( 

GGCCGCGATC ACCTTTGTGT GTCTTCTGTC TGTTTAAACC TCAGG ATG GCT CGC 17< 



Met Ala Arg 
1 




222 




270 




318 
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TTG TAC AAC CCC ATT CCC GTC CGG CAG AAC TGT TTC ACC GTC AAC AGA 366 
Leu Tyr Asn Pro He Pro Val Arg Gin Asn Cys Phe Thr Val Asn Arg 
55 60 65 

TCC CTG TTC ATC TTC GGA GAA GAT AAC ATT GTC AGG AAA TAT GCC AAG 414 
Ser Leu Phe He Phe Gly Glu Asp Asn He Val Arg Lys Tyr Ala Lys 
? 0 75 80 

AAG CTC ATC GAT TGG CCG CCA TTT GAG TAC ATG ATC CTG GCC ACC ATC 462 
Lys Leu He Asp Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He 
85 9o 95 

ATT GCC AAC TGC ATC GTC CTG GCC CTG GAG CAG CAT CTT CCT GAG GAT 510 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Glu Asp 
100 105 no us 

GAC AAG ACC CCC ATG TCC CGA AGA CTG GAG AAG ACA GAA CCT TAT TTC 55 8 

Asp Lys Thr Pro Met Ser Arg Arg Leu Glu Lys Thr Glu Pro Tyr Phe 
120 125 130 

ATT GGG ATC TTT TGC TTT GAA GCT GGG ATC AAA ATT GTG GCC CTG GGG 606 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He Val Ala Leu Gly 
135 140 X45 

TTC ATC TTC CAT AAG GGC TCT TAC CTC CGC AAT GGC TGG AAT GTC ATG 654 
Phe He Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
150 155 160 

GAC TTC ATC GTG GTC CTC AGT GGC ATC CTG GCC ACT GCA GGA ACC CAC 7 02 

Asp Phe He Val Val Leu Ser Gly He Leu Ala Thr Ala Gly Thr His 
165 170 175 

TTC AAT ACT CAC GTG GAC CTG AGG ACC CTC CGG GCT GTG CGT GTC CTG 750 
Phe Asn Thr His Val Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu 
180 i 8 5 190 195 

CGG CCT TTG AAG CTC GTG TCA GGG ATA CCT AGC CTG CAG ATT GTG TTG 798 
Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser Leu Gin He Val Leu 
200 205 210 

AAG TCC ATC ATG AAG GCC ATG GTA CCT CTT CTG CAG ATT GGC CTT CTG 84 6 

Lys Ser He Met Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu 
215 220 225 

CTC TTC TTT GCC ATC CTG ATG TTT GCT ATC ATT GGT TTG GAG TTC TAC 894 
Leu Phe Phe Ala He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr 
230 235 240 

AGT GGC AAG TTA CAT CGA GCG TGC TTC ATG AAC AAT TCA GGT ATT CTA 942 
Ser Gly Lys Leu His Arg Ala Cys Phe Met Asn Asn Ser Gly He Leu 
245 250 255 

GAA GGA TTT GAC CCC CCT CAC CCA TGT GGT GTG CAG GGC TGC CCA GCT 990 
Glu Gly Phe Asp Pro Pro His Pro Cys Gly Val Gin Gly Cys Pro Ala 
260 265 270 275 
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GGT TAT GAA TGC AAG GAC TGG ATC GGC CCC AAT GAT GGG ATC ACC CAG 1038 
Glv Tyr Glu Cys Lys Asp Trp lie Gly Pro Asn Asp Gly lie Thr Gin 
280 285 290 

TTT GAT AAC ATC CTT TTT GCT GTG CTG ACT GTC TTC CAG TGC ATC ACC 1086 
Phe Asp Asn lie Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr 
295 300 305 

ATG GAA GGG TGG ACC ACT GTG CTG TAC AAT ACC AAT GAT GCC TTA GGA 1134 
Met Glu Gly Trp Thr Thr Val Leu Tyr Asn Thr Asn Asp Ala Leu Gly 
310 315 320 

GCC ACC TGG AAT TGG CTG TAC TTC ATC CCC CTC ATC ATC ATT GGA TCC 1182 
Ala Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser 
325 330 335 

TTC TTT GTT CTC AAC CTA GTC CTG GGA GTG CTT TCC GGG GAA TTT GCC 1230 
Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala 
340 345 350 355 

AAA GAG AGA GAG AGA GTG GAG AAC CGA AGG GCT TTC ATG AAG CTG CGG 1278 
Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Met Lys Leu Arg 
360 365 370 

CGC CAG CAG CAG ATT GAG CGT GAG CTG AAT GGC TAC CGT GCC TGG ATA 1326 
Arg Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Arg Ala Trp He 
375 380 385 

GAC AAA GCA GAG GAA GTC ATG CTC GCT GAA GAA AAT AAA AAT GCT GGA 1374 
Asp Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asn Lys Asn Ala Gly 
390 395 400 

ACA TCC GCC TTA GAA GTG CTT CGA AGG GCA ACC ATC AAG AGG AGC CGG 1422 
Thr Ser Ala Leu Glu Val Leu Arg Arg Ala Thr He Lys Arg Ser Arg 
405 410 415 

ACA GAG GCC ATG ACT CGA GAC TCC AGT GAT GAG CAC TGT GTT GAT ATC 1470 
Thr Glu Ala Met Thr Arg Asp Ser Ser Asp Glu His Cys Val Asp He 
420 425 430 435 

TCC TCT GTG GGC ACA CCT CTG GCC CGA GCC AGT ATC AAA AGT GCA AAG 1518 
Ser Ser Val Gly Thr Pro Leu Ala Arg Ala Ser He Lys Ser Ala Lys 
440 445 450 

GTA GAC GGG GTC TCT TAT TTC CGG CAC AAG GAA AGG CTT CTG CGC ATC 1566 
Val Asp Gly Val Ser Tyr Phe Arg His Lys Glu Arg Leu Leu Arg He 
455 460 465 

TCC ATT CGC CAC ATG GTT AAA TCC CAG GTG TTT TAC TGG ATT GTG CTG 1614 
Ser He Arg His Met Val Lys Ser Gin Val Phe Tyr Trp He Val Leu 
470 475 480 

AGC CTT GTG GCA CTC AAC ACT GCC TGT GTG GCC ATT GTC CAT CAC AAC 16 62 

Ser Leu Val Ala Leu Asn Thr Ala Cys Val Ala He Val His His Asn 
485 490 495 
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CAG CCC CAG TGG CTC ACC CAC CTC CTC TAC TAT GCA GAA TTT CTG TTT 
Gin Pro Gin Trp Leu Thr His Leu Leu Tyr Tyr Ala GiJ Ill LeS Se 
500 505 510 515 

CTG GGA CTC TTC CTC TTG GAG ATG TCC CTG AAG ATG TAT GGC ATG GGG 
Leu Gly Leu Phe Leu Leu Glu Met Ser Leu Lys Met Tyr Gly Met Glv 
520 525 / 530 

CCT CGC CTT TAT TTT CAC TCT TCA TTC AAC TGC TTT GAT TTT GGG GTC 
Pro Arg Leu Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
535 540 545 

Thr vJ? r?5 c GT ir G ^ GTG GTC TGG GCA ATC TTC AGA CCT GGT 

Thr Val Gly Ser He Phe Glu Val Val Trp Ala He Phe Arg Pro Gly 

550 555 560 

ACG TCT TTT GGA ATC AGT GTC TTG CGA GCC CTC CGG CTT CTA AGA ATA 
Thr Ser Phe Gly He Ser Val Leu Arg Ala Leu Sg Leu EeJ £?g nj 
565 570 575 

III t? A ™° ^ G I AT TGG GCT TCC CTA CGG ^ TTG GTG GTC TCC 

Phe Lys He Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser 

580 585 590 595 

TTG ATG AGC TCA ATG AAG TCT ATC ATC AGT TTG CTT TTC CTC CTC TTC 
Leu Met Ser Ser Met Lys Ser He He Ser Leu Leu Phe LeS Zlu 2S 
600 605 610 

LeS Se llf v!? vl? ll T T CTC CTA GGA ATG ^ TTA GGA GGC 

Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Glv 

615 620 625 

AGG TTT AAC TTT AAT GAT GGG ACT CCT TCG GCA AAT TTT GAT ACC TTC 
Arg Phe Asn Phe Asn Asp Gly Thr Pro Ser Ala £sn pJe Sp ?£ 
630 635 640 

CCT GCA GCC ATC ATG ACT GTG TTC CAG ATC CTG ACG GGT GAG GAC TGG 
Pro Ala Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Sp 52 
645 650 655 

AAT GAG GTG ATG TAC AAT GGG ATC CGC TCC CAG GGT GGG GTC AGC TCA 
Asn Glu Val Met Tyr Asn Gly He Arg Ser Gin Gly Gly Val Ser Ser 
660 6 *5 670 675 

AT ? I GG TCT GCC ATC TAC TT C ATT GTG CTC ACC TTG TTT GGC AAC 
Gly Met Trp Ser Ala He Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
680 685 690 

TAC ACQ CTA CTG AAT GTG TTC TTG GCT ATC GCT GTG GAT AAT CTC GCC 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
695 700 705 

^ G GAA CTG ACC AAG GAT GAA CAG GAG GAA GAA GAG GCC TTC 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Gin Glu Glu Glu Glu Ala Phe 
710 715 720 
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AAC CAG AAA CAT GCA CTG CAG AAG GCC AAG GAG GTC AGC CCG ATG TCT 2382 
Asn Gin Lys His Ala Leu Gin Lys Ala Lys Glu Val Ser Pro Met Ser 
725 730 735 

GCA CCC AAC ATG CCT TCG ATC GAG AGG GAG CGG AGG CGC CGG CAC CAC 24 3 0 

Ala Pro Asn Met Pro Ser lie Glu Arg Glu Arg Arg Arg Arg His His 
740 745 750 755 

ATG TCC GTG TGG GAG CAG CGT ACC AGC CAG CTG AGG AAG CAC ATG CAG 2478 
Met Ser Val Trp Glu Gin Arg Thr Ser Gin Leu Arg Lys His Met Gin 
760 765 770 

ATG TCC AGC CAG GAG GCC CTC AAC AGA GAG GAG GCG CCG ACC ATG AAC 2526 
Met Ser Ser Gin Glu Ala Leu Asn Arg Glu Glu Ala Pro Thr Met Asn 
775 780 785 

CCG CTC AAC CCC CTC AAC CCG CTC AGC TCC CTC AAC CCG CTC AAT GCC 2574 
Pro Leu Asn Pro Leu Asn Pro Leu Ser Ser Leu Asn Pro Leu Asn Ala 
790 795 800 

CAC CCC AGC CTT TAT CGG CGA CCC AGG GCC ATT GAG GGC CTG GCC CTG 2622 
His Pro Ser Leu Tyr Arg Arg Pro Arg Ala He Glu Gly Leu Ala Leu 
805 * 810 " 815 

GGC CTG GCC CTG GAG AAG TTC GAG GAG GAG CGC ATC AGC CGT GGG GGG 2670 
Gly Leu Ala Leu Glu Lys Phe Glu Glu Glu Arg He Ser Arg Gly Gly 
820 825 830 835 

TCC CTC AAG GGG GAT GGA GGG GAC CGA TCC AGT GCC CTG GAC AAC CAG 2718 
Ser Leu Lys Gly Asp Gly Gly Asp Arg Ser Ser Ala Leu Asp Asn Gin 
840 845 850 

AGG ACC CCT TTG TCC CTG GGC CAG CGG GAG CCA CCA TGG CTG GCC AGG 2766 
Arg Thr Pro Leu Ser Leu Gly Gin Arg Glu Pro Pro Trp Leu Ala Arg 
855 860 865 

CCC TGT CAT GGA AAC TGT GAC CCG ACT CAG CAG GAG GCA GGG GGA GGA 2 814 

Pro Cys His Gly Asn Cys Asp Pro Thr Gin Gin Glu Ala Gly Gly Gly 
870 875 880 

GAG GCT GTG GTG ACC TTT GAG GAC CGG GCC AGG CAC AGG CAG AGC CAA 2862 
Glu Ala Val Val Thr Phe Glu Asp Arg Ala Arg His Arg Gin Ser Gin 
885 890 895 

CGG CGC AGC CGG CAT CGC CGC GTC AGG ACA GAA GGC AAG GAG TCC TCT 2910 
Arg Arg Ser Arg His Arg Arg Val Arg Thr Glu Gly Lys Glu Ser Ser 
900 ~ 905 910 915 

TCA GCC TCC CGG AGC AGG TCT GCC AGC CAG GAA CGC AGT CTG GAT GAA 2958 
Ser Ala Ser Arg Ser Arg Ser Ala Ser Gin Glu Arg Ser Leu Asp Glu 
920 925 930 

GCC ATG CCC ACT GAA GGG GAG AAG GAC CAT GAG CTC AGG GGC AAC CAT 3 006 

Ala Met Pro Thr Glu Gly Glu Lys Asp His Glu Leu Arg Gly Asn His 
935 940 945 
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GGT GCC AAG GAG CCA ACG ATC CAA GAA GAG AGA GCC GAG GAT TTA AGG 
Gly Ala Lys Glu Pro Thr He Gin Glu Glu Arg Ala Gin Asp Leu Arcr 
350 955 ~ 960 

AGG ACC AAC AGT CTG ATG GTG TCC AGA GGC TCC GGG CTG GCA GGA GGC 
Arg Thr Asn Ser Leu Met Val Ser Arg Gly Ser Gly Leu Ala Glv Glv 
965 970 975 

CTT GAT GAG GCT GAC ACC CCC CTA GTC CTG CCC CAT CCT GAG CTG GAA 
Leu Asp Glu Ala Asp Thr Pro Leu Val Leu Pro His Pro Glu Leu Glu 
980 985 S9Q 

GTG GGG AAG CAC GTG GTG CTG ACG GAG CAG GAG CCA GAA GGC AGC AGT 
Val Gly Lys His Val Val Leu Thr Glu Gin Glu Pro Glu Gly Ser Ser 
1000 1005 1010 

GAG CAG GCC CTG CTG GGG AAT GTG CAG CTA GAC ATG GGC CGG GTC ATC 
Glu Gin Ala Leu Leu Gly Asn Val Gin Leu Asp Met Gly Arg Val He 
1015 1020 1025 

AGC CAG AGC GAG CtT GAC CTC TCC TGC ATC ACG GCC AAC ACG GAC AAG 
Ser Gin Ser Glu Pro Asp Leu Ser Cys He Thr Ala Asn Thr Asp Lys 
1Q 30 1035 1040 

GCC ACC ACC GAG AGC ACC AGC GTC ACC GTC GCC ATC CCC GAC GTG GAC 
Ala Thr Thr Glu Ser Thr Ser Val Thr Val Ala He Pro Asp Val Asp 
104 5 1050 1055 

CCC TTG GTG GAC TCA ACC GTG GTG CAC ATT AGC AAC AAG ACG GAT GGG 
Pro Leu Val Asp Ser Thr Val Val His He Ser Asn Lys Thr Asp Gly 
1060 1065 1070 1075 

GAA GCC AGT CCC TTG AAG GAG GCA GAG ATC AGA GAG GAT GAG GAG GAG 
Glu Ala Ser Pro Leu Lys Glu Ala Glu He Arg Glu Asp Glu Glu Glu 
1080 1085 ~ 1090 

GTG GAG AAG AAG AAG CAG AAG AAG GAG AAG CGT GAG ACA GGC AAA GCC 
Val Glu Lys Lys Lys Gin Lys Lys Glu Lys Arg Glu Thr Gly Lys Ala 
1095 iioo H05 

ATG GTG CCC CAC AGC TCA ATG TTC ATC TTC AGC ACC ACC AAC CCG ATC 
Met Val Pro His Ser Ser Met Phe lie Phe Ser Thr Thr Asn Pro He 
IHO 1115 1120 

CGG AGG GCC TGC CAC TAC ATC GTG AAC CTG CGC TAC TTT GAG ATG TGC 
Arg Arg Ala Cys His Tyr He Val Asn Leu Arg Tyr Phe Glu Met Cys 
1125 H30 H35 

ATC CTC CTG GTG ATT GCA GCC AGC AGC ATC GCC CTG GCG GCA GAG GAC 
He Leu Leu Val He Ala Ala Ser Ser He Ala Leu Ala Ala Glu Asp 
114 ° 1145 H50 H55 

CCC GTC CTG ACC AAC TCG GAG CGC AAC AAA GTC CTG AGG TAT TTT GAC 
Pro Val Leu Thr Asn Ser Glu Arg Asn Lys Val Leu Arg Tyr Phe Asp 
1160 lies H70 
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TAT GTG TTC ACG GGC GTG TTC ACC TTT GAG ATG GTT ATA AAG ATG ATA 3726 
Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val lie Lys Met lie 
1175 1180 1185 

GAC CAA GGC TTG ATC CTG CAG GAT GGG TCC TAC TTC CGA GAC TTG TGG 3774 
Asp Gin Gly Leu lie Leu Gin Asp Gly Ser Tyr Phe Arg Asp Leu Trp 
1190 1195 1200 

AAC ATC CTG GAC TTT GTG GTG GTC GTT GGC GCA TTG GTG GCC TTT GCT 3 822 

Asn He Leu Asp Phe Val Val Val Val Gly Ala Leu Val Ala Phe Ala 
1205 ^1210 1215 

CTG GCG AAC GCT TTG GGA ACC AAC AAA GGA CGG GAC ATC AAG ACC ATC 3 87 0 

Leu Ala Asn Ala Leu Gly Thr Asn Lys Gly Arg Asp He Lys Thr He 
1220 1225 1230 1235 

AAG TCT CTG CGG GTG CTC CGA GTT CTA AGG CCA CTG AAA ACC ATC AAG 3 918 

Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr He Lys 
1240 1245 1250 

CGC TTG CCC AAG CTC AAG GCC GTC TTC GAC TGC GTA GTG ACC TCC TTG 3966 
Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Thr Ser Leu 
1255 1260 1265 

AAG AAT GTC TTC AAC ATA CTC ATT GTG TAC AAG CTC TTC ATG TTC ATC 4 014 

Lys Asn Val Phe Asn He Leu He Val Tyr Lys Leu Phe Met Phe He 
1270 1275 1280 

TTT GCT GTC ATC GCA GTT CAG CTC TTC AAG GGA AAG TTC TTT TAT TGC 4 062 

Phe Ala Val He Ala Val Gin Leu Phe Lys Gly Lys Phe Phe Tyr Cys 
1285 1290 1295 

ACG GAC AGT TCC AAG GAC ACA GAG AAG GAG TGC ATA GGC AAC TAT GTA 4110 
Thr Asp Ser Ser Lys Asp Thr Glu Lys Glu Cys He Gly Asn Tyr Val 
1300 1305 1310 1315 

GAT CAC GAG AAA AAC AAG ATG GAG GTG AAG GGC CGG GAA TGG AAG CGC 4158 
Asp His Glu Lys Asn Lys Met Glu Val Lys Gly Arg Glu Trp Lys Arg 
1320 1325 1330 

CAT GAA TTC CAC TAC GAC AAC ATT ATC TGG GCC CTG CTG ACC CTC TTC 4206 
His Glu Phe His Tyr Asp Asn He He Trp Ala Leu Leu Thr Leu Phe 
1335 1340 1345 

ACC GTC TCC ACA GGG GAA GGA TGG CCT CAA GTT CTG CAG CAC TCT GTA 4254 
Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Gin His Ser Val 
1350 " ' 1355 1360 

GAT GTG ACA GAG GAA GAC CGA GGC CCA AGC CGC AGC AAC CGC ATG GAG 4302 
Asp Val Thr Glu Glu Asp Arg Gly Pro Ser Arg Ser Asn Arg Met Glu 
1365 1370 1375 

ATG TCT ATC TTT TAT GTA GTC TAC TTT GTG GTC TTC CCC TTC TTC TTT 4350 
Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe Phe 
1380 1385 1390 1395 
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GTC AAT ATC TTT GTG GCT CTC ATC ATC ATC ACC TTC CAG GAG CAA GGG 
Val Asn lie Phe Val Ala Leu He He He Thr Phe Gin Glu Gin Gly 
1400 1405 1410 

GAT AAG ATG ATG GAG GAG TGC AGC CTG GAG AAG AAT GAG AGG GCG TGC 
Asp Lys Met Met Glu Glu Cys Ser Leu Glu Lys Asn Glu Arg Ala Cys 
1415 1420 1425 

ATC GAC TTC GCC ATC AGC GCC AAA CCT CTC ACC CGC TAC ATG CCG CAG 
lie Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg Tyr Met Pro Gin 
1430 1435 144Q 

AAC AGA CAC ACC TTC CAG TAC CGC GTG TGG CAC TTT GTG GTG TCT CCG 
Asn Arg His Thr Phe Gin Tyr Arg Val Trp His Phe Val Val Ser Pro 
1445 1450 1455 

TCC TTT GAG TAC ACC ATT ATG GCC ATG ATC GCC TTG AAT ACT GTT GTG 

he lu Thr Ile Met Ala Met Ile Leu Asn Thr Val Val 

1460 "65 1470 1475 

m T ? 1 AT TAT TCT GCT CCC TGT ACC TAT GAG CTG GCC CTG 4638 

Leu Met Met Lys Tyr Tyr Ser Ala Pro Cys Thr Tyr Glu Leu Ala Leu 

1480 1485 1490 

AAG TAC CTG AAT ATC GCC TTC ACC ATG GTG TTT TCC CTG GAA TGT GTC 
Lys Tyr Leu Asn Ile Ala Phe Thr Met Val Phe Ser Leu Glu Cys Val 
1495 1500 1505 

CTG AAG GTC ATC GCT TTT GGC TTT TTG AAC TAT TTC CGA GAC ACC TGG 4734 
Leu Lys Val Ile Ala Phe Gly Phe Leu Asn Tyr Phe Arg Asp Thr Trp 
1510 1515 - 1520 

AAT ATC TTT GAC TTC ATC ACC GTG ATT GGC AGT ATC ACA GAA ATT ATC 
Asn lie Phe Asp Phe Ile Thr Val Ile Gly Ser He Thr Glu Ile He 
1525 1530 1535 

CTG ACA GAC AGC AAG CTG GTG AAC ACC AGT GGC TTC AAT ATG AGC TTT 4 830 

Leu Thr Asp Ser Lys Leu Val Asn Thr Ser Gly Phe Asn Met Ser Phe 
1540 1545 1550 1555 

CTG AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTC CTG CGT CAG GGC 
Leu Lys Leu Phe Arg Ala Ala Arg Leu Ile Lys Leu Leu Arg Gin Gly 
1560 1565 1570 

TAT ACC ATA CGC ATT TTG CTG TGG ACC TTT GTG CAG TCC TTT AAG GCC 4 926 

Tyr Thr Ile Arg He Leu Leu Trp Thr Phe Val Gin Ser Phe Lys Ala 
1575 1580 1585 

CTC CCT TAT GTC TGC CTT TTA ATT GCC ATG CTT TTC TTC ATT TAT GCC 4974 
Leu Pro Tyr Val Cys Leu Leu Ile Ala Met Leu Phe Phe Ile Tyr Ala 
1590 1595 1600 

ATC ATT GGG ATG CAG GTA TTT GGA AAC ATA AAA TTA GAC GAG GAG AGT 5022 
He Ile Gly Met Gin Val Phe Gly Asn Ile Lys Leu Asp Glu Glu Ser 
1605 1610 1615 
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CAC ATC AAC CGG CAC AAC AAC TTC CGG AGT TTC TTT GGG TCC CTA ATG 5070 
His He Asn Arg His Asn Asn Phe Arg Ser Phe Phe Gly Ser Leu Met 
1620 1625 1630 1635 

CTA CTC TTC AGG AGT GCC ACA GGT GAG GCC TGG CAG GAG ATT ATG CTG 5118 
Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp Gin Glu He Met Leu 
1640 1645 1650 

TCA TGC CTT GGG GAG AAG GGC TGT GAG CCT GAC ACC ACC GCA CCA TCA 5166 
Ser Cys Leu Gly Glu Lys Gly Cys Glu Pro Asp Thr Thr Ala Pro Ser 
1655 ' 1660 1665 

GGG CAG AAC GAG AAT GAA CGC TGC X3GC ACC GAT CTG GCC TAC GTG TAC 5214 
Gly Gin Asn Glu Asn Glu Arg Cys Gly Thr Asp Leu Ala Tyr Val Tyr 
1670 1675 1680 

TTT GTC TCC TTC ATC TTC TTC TGC TCC TTC TTG ATG CTC AAC CTG TTT 5262 
Phe Val Ser Phe He Phe Phe Cys Ser Phe Leu Met Leu Asn Leu Phe 
1685 1690 1695 

GTG GCC GTC ATC ATG GAC AAC TTT GAG TAC CTG ACT CGG GAC TCC TCC 5310 
Val Ala Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser 
1700 1705 1710 1715 

ATC CTG GGG CCT CAC CAC TTG GAC GAG TTT GTC CGC GTC TGG GCA GAA 5358 
He Leu Gly Pro His His Leu Asp Glu Phe Val Arg Val Trp Ala Glu 
1720 1725 1730 

TAT GAC CGA GCA GCA TGT GGC CGC ATC CAT TAC ACT GAG ATG TAT GAA 5406 
Tyr Asp Arg Ala Ala Cys Gly Arg He His Tyr Thr Glu Met Tyr Glu 
1735 1740 1745 

ATG CTG ACT CTC ATG TCA CCT CCG CTA GGC CTC GGC AAG AGA TGT CCC 54 54 

Met Leu Thr Leu Met Ser Pro Pro Leu Gly Leu Gly Lys Arg Cys Pro 
1750 1755 1760 

TCC AAA GTG GCA TAT AAG AGG TTG GTC CTG ATG AAC ATG CCA GTA GCT 55 02 

Ser Lys Val Ala Tyr Lys Arg Leu Val Leu Met Asn Met Pro Val Ala 
1765 1770 1775 

GAG GAC ATG ACG GTC CAC TTC ACC TCC ACA CTT ATG GCT CTG ATC CGG 5550 
Glu Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg 
1780 1785 1790 1795 

ACA GCT CTG GAC ATT AAA ATT GCC AAA GGT GGT GCA GAC AGG CAG CAG 5598 
Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp Arg Gin Gin 
1800 1805 1810 

CTA GAC TCA GAG CTA CAA AAG GAG ACC CTA GCC ATC TGG CCT CAC CTA 5646 
Leu Asp Ser Glu Leu Gin Lys Glu Thr Leu Ala He Trp Pro His Leu 
1815 1820 1825 

TCC CAG AAG ATG CTG GAT CTG CTT GTG CCC ATG CCC AAA GCC TCT GAC 5694 
Ser Gin Lys Met Leu Asp Leu Leu Val Pro Met Pro Lys Ala Ser Asp 
1830 1835 1840 
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CTG ACT GTG GGC AAA ATC TAT GCA GCA ATG ATG ATC ATG GAC TAC TAT 

ToL Val Gly LyS Ile *y* Ala Met lie Met Asp!X!S 

1845 1850 1855 

AAG CAG AGT AAG GTG AAG AAG CAG AGG CAG CAG CTG GAG GAA CAG AAA 

™ n Ser LyS Val Lys Lys Gln ^9 Gln Gln Leu Glu Glu Gin Lys 
1860 1865 i 8 70 

iS S CC « T ? 11° ^ G CGC ATG GAG CCT TCA TCT CTG CCT CAG GAG 

Asn Ala Pro Met Phe Gln Arg Met Glu Pro Ser Ser Leu Pro Gln Glu 

1880 1885 1890 

JS zll S5 £1 A? C GCC CTG CCT TAC CTC CAG CAG GAC C CC GTT 

Ile Ile Ala Asn Ala Lys Ala Leu Pro Tyr Leu Gln Gln Asp Pro Val 

18 95 1900 1905 

TCA GGC CTG AGT GGC CGG AGT GGA TAC CCT TCG ATG AGT CCA CTC TOT 
Ser Gly Leu ser Gly Arg Ser Gly Tyr Pro Ser Ser Pro 2S te? 

1910 1915 1920 

Pro 32 t*Z T?o oh C ^ G P G GCT TGT ATG ^ CCC GCC GAT GAC GGA 
Pr ° G i" Asp Ile Phe Gln Leu Cys Met Asp Pro Ala Asp Asp Gly 

1930 1935 

S2 * G ° TCT CTG GTG GTG ACA ^ CCT AGC TCC ATG 

?p^n n G1U ^ Gln Ser Leu Val Val Thr Asp Pro Ser Ser Met 

1940 1945 1950 1955 

a™ ST I CC ATT CGG GAT ^ TCA AAT TCC TCG TGG 

Arg Arg Ser Phe Ser Thr Ile Arg Asp Lys Arg Ser Asn Ser Ser Trp 

19 60 1965 1970 

11?, rif, n^t ll C I CC ATG GAG CGA AGC AGT GAA AAT ACC TAC AAG TCC 
Leu Glu Glu Phe Ser Met Glu Arg Ser Ser Glu Asn Thr Tyr Lys Ser 

1980 1985 

CGT CGC CGG AGT TAC CAC TCC TCC TTG CGG CTG TCA GCC CAC CGC CTG 
Arg Arg Arg Ser Tyr His Ser Ser Leu Arg Leu Ser Ala His Arg Eeu 
1990 1995 2000 

AAC TCT GAT TCA GGC CAC AAG TCT GAC ACT CAC CCC TCA GGG GGC AGG 

^nL P Ser Gly His Lys Ser As P Thr His p ro Ser Gly Gly Arg 
2005 2010 2015 

GAG CGG CGA CGA TCA AAA GAG CGA AAG CAT CTT CTC TCT CCT GAT GTC 
oio^ 9 ^ g Ser Lys Glu ^ L V S His Leu ^u Ser Pro Asp Val 

2020 2025 2030 2035 

TCC CGC TGC AAT TCA GAA GAG CGA GGG ACC CAG GCT GAC TGG GAG TCC 
Ser Arg Cys Asn Ser Glu Glu Arg Gly Thr Gln Ala Asp Trp Glu Ser 
2040 2045 ^ 2050 

CCA GAG CGC CGT CAA TCC AGG TCA CCC AGT GAG GGC AGG TCA CAG ACG 
Pro Glu Arg Arg Gln Ser Arg Ser Pro Ser Glu Gly Arg Ser Gln Thr 
2055 2060 2065 
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CCC AAC AGA CAG GGC ACA GGT TCC CTA AGT GAG AGC TCC ATC CCC TCT 6414 
Pro Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu Ser Ser lie Pro Ser 
2070 2075 2080 

GTC TCT GAC ACC AGC ACC CCA AGA AGA AGT CGT CGG CAG CTC CCA CCC 6462 
Val Ser Asp Thr Ser Thr Pro Arg Arg Ser Arg Arg Gin Leu Pro Pro 
2085 2090 2095 

GTC CCG CCA AAG CCC CGG CCC CTC CTT TCC TAC AGC TCC CTG ATT CGA 6510 
Val Pro Pro Lys Pro Arg Pro Leu Leu Ser Tyr Ser Ser Leu lie Arg 
2100 2105 2110 2115 

CAC GCG GGC AGC ATC TCT CCA CCT GCT GAT GGA AGC GAG GAG GGC TCC 6558 
His Ala Gly Ser lie Ser Pro Pro Ala Asp Gly Ser Glu Glu Gly Ser 
2120 2125 2130 

CCG CTG ACC TCC CAA GCT CTG GAG AGC AAC AAT GCT TGG CTG ACC GAG 6606 
Pro Leu Thr Ser Gin Ala Leu Glu Ser Asn Asn Ala Trp Leu Thr Glu 
2135 2140 2145 

TCT TCC AAC TCT CCG CAC CCC CAG CAG AGG CAA CAT GCC TCC CCA CAG 6654 
Ser Ser Asn Ser Pro His Pro Gin Gin Arg Gin His Ala Ser Pro Gin 
2150 2155 2160 

CGC TAC ATC TCC GAG CCC TAC TTG GCC CTG CAC GAA GAC TCC CAC GCC 6702 
Arg Tyr lie Ser Glu Pro Tyr Leu Ala Leu His Glu Asp Ser His Ala 
2165 2170 2175 

TCA GAC TGT GTT GAG GAG GAG ACG CTC ACT TTC GAA GCA GCC GTG GCT 675 0 

Ser Asp Cys Val Glu Glu Glu Thr Leu Thr Phe Glu Ala Ala Val Ala 
2180 2185 2190 2195 

ACT AGC CTG GGC CGT TCC AAC ACC ATC GGC TCA GCC CCA CCC CTG CGG 67 98 

Thr Ser Leu Gly Arg Ser Asn Thr lie Gly Ser Ala Pro Pro Leu Arg 
2200 2205 2210 

CAT AGC TGG CAG ATG CCC AAC GGG CAC TAT CGG CGG CGG AGG CGC GGG 684 6 

His Ser Trp Gin Met Pro Asn Gly His Tyr Arg Arg Arg Arg Arg Gly 
2215 2220 2225 

GGG CCT GGG CCA GGC ATG ATG TGT GGG GCT GTC AAC AAC CTG CTA AGT 6 8 94 

Gly Pro Gly Pro Gly Met Met Cys Gly Ala Val Asn Asn Leu Leu Ser 
2230 2235 2240 

GAC ACG GAA GAA GAT GAC AAA TGC TAGAGGCTGC TCCCCCCTCC GATGCATGCT 6948 
Asp Thr Glu Glu Asp Asp Lys Cys 
2245 2250 

CTTCTCTCAC ATGGAGAAAA CCAAGACAGA ATTGGGAAGC CAGTGCGGCC CCGCGGGGAG 7008 

GAAGAGGGAA AAGGAAGATG GAAG 7032 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7089 base pairs 



BNSDOCID: <WO 9504822A1_l_> 



WO 95/04822 



PCT/US94/09230 



-212- 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.. 6978 

(D) OTHER INFORMATION: /standardnarae- "Alpha -IE- 3 '■ 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCCAG AGGCGGTGGT CCCCGTGCTT 
GTCTGGATGC GGCTCTGAGT CTCCGTGTGT CTTTCTGCTT GTTGCTGTGT GCGGGTGTTC 
GGCCGCGATC ACCTTTGTGT GTCTTCTGTC TGTTTAAACC TCAGG ATG GCT CGC 

Met Ala Arg 

S 2? S S 55 S. c S S S 5 S 25 S» 2? S - 



GAC CAG AGC AGG AAC CGG CAA GGA ACC CCC GTG CCG GCC TCG GGG CAP 
Asp Gin ser Arg Asn Arg Gin Gly Thr Pro Va? Pro S Se? Gly Sn 

25 30 35 

1 AC ^ ACG *** GCA CAG AGG GCG CGG ACT ATG GCT 

Ala Ala Ala Tyr Lys Gin Thr Lys Ala Gin Arg Ala i£g J£ Set SI 

40 45 ~ so 

TTG TAC AAC CCC ATT CCC GTC CGG CAG AAC TGT TTC ACC GTC AAC APA 
Leu Tyr Asn Pro He Pro Val Arg Gin Asn Cys Pl£ £nr SE A^S 
55 60 65 

TCC CTG TTC ATC TTC GGA GAA GAT AAC ATT GTC AGG AAA TAT GCC AAG 
Ser Leu Phe He Phe Gly Glu Asp Asn He Val Arg Tyr Ala Jy*s 

AAG CTC ATC GAT TGG CCG CCA TTT GAG TAC ATG ATC CTG GCC ACC ATC 
Lys Leu He Asp Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He 

90 <J5 

tTI ^ C l GC A ?° GTC CTG GCC CTG GAG CAT CTT CCT GAG GAT 

lie Ala Asn Cys lie Val Leu Ala Leu Glu Gin His Leu Pro Glu Sp 

105 110. us 

f** ^ CC ATG TCC CGA AGA CTG GAG AAG ACA GAA CCT TAT TTC 

Asp Lys Thr Pro Met Ser Arg Arg Leu Glu Lys Thr Glu Pro Tyr Phe 
120 125 

ATT GGG ATC TTT TGC TTT GAA GCT GGG ATC AAA ATT GTG GCC CTG GGG 
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lie Gly lie Phe Cys Phe Glu Ala Gly He Lys He Val Ala Leu Gly 
135 140 145 

TTC ATC TTC CAT AAG GGC TCT TAC CTC CGC AAT GGC TGG AAT GTC ATG 654 
Phe He Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
150 155 160 

GAC TTC ATC GTG GTC CTC AGT GGC ATC CTG GCC ACT GCA GGA ACC CAC 702 
Asp Phe He Val Val Leu Ser Gly He Leu Ala Thr Ala Gly Thr His 
165 170 175 

TTC AAT ACT CAC GTG GAC CTG AGG ACC CTC CGG GCT GTG CGT GTC CTG 750 
Phe Asn Thr His Val Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu 
180 185 190 .195 

CGG CCT TTG AAG CTC GTG TCA GGG ATA CCT AGC CTG CAG ATT GTG TTG 798 
Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser Leu Gin He Val Leu 
200 205 210 

AAG TCC ATC ATG AAG GCC ATG GTA CCT CTT CTG CAG ATT GGC CTT CTG 846 
Lys Ser He Met Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu 
215 220 225 

CTC TTC TTT GCC ATC CTG ATG TTT GCT ATC ATT GGT TTG GAG TTC TAC 894 
Leu Phe Phe Ala He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr 
230 235 240 

AGT GGC AAG TTA CAT CGA GCG TGC TTC ATG AAC AAT TCA GGT ATT CTA 942 
Ser Gly Lys Leu His Arg Ala Cys Phe Met Asn Asn Ser Gly He Leu 
245 250 255 

GAA GGA TTT GAC CCC CCT CAC CCA TGT GGT GTG CAG GGC TGC CCA GCT 990 
Glu Gly Phe Asp Pro Pro His Pro Cys Gly Val Gin Gly Cys Pro Ala 
260 265 270 275 

GGT TAT GAA TGC AAG GAC TGG ATC GGC CCC AAT GAT GGG ATC ACC CAG 1038 
Gly Tyr Glu Cys Lys Asp Trp He Gly Pro Asn Asp Gly He Thr Gin 
280 285 290 

TTT GAT AAC ATC CTT TTT GCT GTG CTG ACT GTC TTC CAG TGC ATC ACC 108 6 

Phe Asp Asn He Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr 
295 300 305 

ATG GAA GGG TGG ACC ACT GTG CTG TAC AAT ACC AAT GAT GCC TTA GGA 1134 
Met Glu Gly Trp Thr Thr Val Leu Tyr Asn Thr Asn Asp Ala Leu Gly 
310 315 320 

GCC ACC TGG AAT TGG CTG TAC TTC ATC CCC CTC ATC ATC ATT GGA TCC 1182 
Ala Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser 
325 330 335 

TTC TTT GTT CTC AAC CTA GTC CTG GGA GTG CTT TCC GGG GAA TTT GCC 123 0 

Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala 
340 345 350 355 

AAA GAG AGA GAG AGA GTG GAG AAC CGA AGG GCT TTC ATG AAG CTG CGG 1278 
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Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Met Lys Leu Arg 
360 365 370 

CGC CAG CAG CAG ATT GAG CGT GAG CTG AAT GGC TAC CGT GCC TGG ATA 1326 
Arg Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Arg Ala Trp He 
375 380 385 

GAC AAA GCA GAG GAA GTC ATG CTC GCT GAA GAA AAT AAA AAT GCT GGA 1374 
Asp Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asn Lys Asn Ala Gly 
390 395 400 

ACA TCC GCC TTA GAA GTG CTT CGA AGG GCA ACC ATC AAG AGG AGC CGG 1422 

?ne Leu Glu Val Leu Ar ^ ^ Ala Thr Ile L y s Ser Arg 

405 410 415 

ACA GAG GCC ATG ACT CGA GAC TCC AGT GAT GAG CAC TGT GTT GAT ATC 
Thr Glu Ala Met Thr Arg Asp Ser Ser Asp Glu His Cys Val Asp Ile 
420 425 430 435 

TCC TCT GTG GGC ACA CCT CTG GCC CGA GCC AGT ATC AAA AGT GCA AAG 
Ser Ser Val Gly Thr Pro Leu Ala Arg Ala Ser Ile Lys Ser Ala Lys 
44 ° 445 - 450 

GTA GAC GGG GTC TCT TAT TTC CGG CAC AAG GAA AGG CTT CTG CGC ATC 
Val Asp Gly Val Ser Tyr Phe Arg His Lys Glu Arg Leu Leu Arg He 
4 55 460 465 

TCC ATT CGC CAC ATG GTT AAA TCC CAG GTG TTT TAC TGG ATT GTG CTG 
Ser Ile Arg His Met Val Lys Ser Gin Val Phe Tyr Trp Ile Val Leu 
470 475 480 

AGC CTT GTG GCA CTC AAC ACT GCC TGT GTG GCC ATT GTC CAT CAC AAC 
Ser Leu Val Ala Leu Asn Thr Ala Cys Val Ala Ile Val His His Asn 
4 85 490 495 

CAG CCC CAG TGG CTC ACC CAC CTC CTC TAC TAT GCA GAA TTT CTG TTT 
Gin Pro Gin Trp Leu Thr His Leu Leu Tyr Tyr Ala Glu Phe Leu Phe 
500 505 510 515 

CTG GGA CTC TTC CTC TTG GAG ATG TCC CTG AAG ATG TAT GGC ATG GGG 175 8 

Leu Gly Leu Phe Leu Leu Glu Met Ser Leu Lys Met Tyr Gly Met Gly 
520 525 530 

CCT CGC CTT TAT TTT CAC TCT TCA TTC AAC TGC TTT GAT TTT GGG GTC 
Pro Arg Leu Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
535 540 545 

ACA GTG GGC AGT ATC TTT GAA GTG GTC TGG GCA ATC TTC AGA CCT GGT 
Thr Val Gly Ser Ile Phe Glu Val Val Trp Ala Ile Phe Arg Pro Gly 
550 555 560 

ACG TCT TTT GGA ATC AGT GTC TTG CGA GCC CTC CGG CTT CTA AGA ATA 1902 
Thr Ser Phe Gly Ile Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
565 570 575 

TTT AAA ATA ACC AAG TAT TGG GCT TCC CTA CGG AAT TTG GTG GTC TCC 195 0 
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Phe Lys lie Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser 
580 585 590 595 

TTG ATG AGC TCA ATG AAG TCT ATC ATC AGT TTG CTT TTC CTC CTC TTC 1998 
Leu Met Ser Ser Met Lys Ser lie lie Ser Leu Leu Phe Leu Leu Phe 
* 600 605 610 

CTC TTC ATC GTT GTC TTT GCT CTC CTA GGA ATG CAG TTA TTT GGA GGC 2046 
Leu Phe He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
615 620 625 

AGG TTT AAC TTT AAT GAT GGG ACT CCT TCG GCA AAT TTT GAT ACC TTC 2094 
Arg Phe Asn Phe Asn Asp Gly Thr Pro Ser Ala Asn Phe Asp Thr Phe 
630 * 635 640 

CCT GCA GCC ATC ATG ACT GTG TTC CAG ATC CTG ACG GGT GAG GAC TGG 2142 
Pro Ala Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
645 650 655 

AAT GAG GTG ATG TAC AAT GGG ATC CGC TCC CAG GGT GGG GTC AGC TCA 2190 
Asn Glu Val Met Tyr Asn Gly He Arg Ser Gin Gly Gly Val Ser Ser 
660 665 670 675 

GGC ATG TGG TCT GCC ATC TAC TTC ATT GTG CTC ACC TTG TTT GGC AAC 2238 
Gly Met Trp Ser Ala He Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
680 ** 685 690 

TAC ACG CTA CTG AAT GTG TTC TTG GCT ATC GCT GTG GAT AAT CTC GCC 2286 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
695 700 705 

AAC GCC CAG GAA CTG ACC AAG GAT GAA CAG GAG GAA GAA GAG GCC TTC 2334 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Gin Glu Glu Glu Glu Ala Phe 
710 715 720 

AAC CAG AAA CAT GCA CTG CAG AAG GCC AAG GAG GTC AGC CCG ATG TCT 2382 
Asn Gin Lys His Ala Leu Gin Lys Ala Lys Glu Val Ser Pro Met Ser 
725 730 735 

GCA CCC AAC ATG CCT TCG ATC GAA AGA GAC AGA AGG AGA AGA CAC CAC 2430 
Ala Pro Asn Met Pro Ser He Glu Arg Asp Arg Arg Arg Arg His His 
740 745 750 755 

ATG TCG ATG TGG GAG CCA CGC AGC AGC CAC CTG AGG GAG CGG AGG CGC 2478 
Met Ser Met Trp Glu Pro Arg Ser Ser His Leu Arg Glu Arg Arg Arg 
760 765 770 

CGG CAC CAC ATG TCC GTG TGG GAG CAG CGT ACC AGC CAG CTG AGG AAG 2526 
Arg His His Met Ser Val Trp Glu Gin Arg Thr Ser Gin Leu Arg Lys 
775 780 785 

CAC ATG CAG ATG TCC AGC CAG GAG GCC CTC AAC AGA GAG GAG GCG CCG 2574 
His Met Gin Met Ser Ser Gin Glu Ala Leu Asn Arg Glu Glu Ala Pro 
790 795 800 

ACC ATG AAC CCG CTC AAC CCC CTC AAC CCG CTC AGC TCC CTC AAC CCG 2622 
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Thr Met Asn Pro Leu Asn Pro Leu Asn Pro Leu Ser Ser Leu Asn Pro 
805 810 815 

CTC AAT GCC CAC CCC AGC CTT TAT CGG CGA CCC AGG GCC ATT GAG GGC 2670 
Leu Asn Ala His Pro Ser Leu Tyr Arg Arg Pro Arg Ala He Glu Gly 
820 825 830 835 

CTG GCC CTG GGC CTG GCC CTG GAG AAG TTC GAG GAG GAG CGC ATC AGC 2718 
Leu Ala Leu Gly Leu Ala Leu Glu Lys Phe Glu Glu Glu Arg He Ser 
840 845 850 

CGT GGG GGG TCC CTC AAG GGG GAT GGA GGG GAC CGA TCC AGT GCC CTG 2766 
Arg Gly Gly Ser Leu Lys Gly Asp Gly Gly Asp Arg Ser Ser Ala Leu 
855 860 865 

GAC AAC CAG AGG ACC CCT TTG TCC CTG GGC CAG CGG GAG CCA CCA TGG 2814 
Asp Asn Gin Arg Thr Pro Leu Ser Leu Gly Gin Arg Glu Pro Pro Trp 
870 875 880 

CTG GCC AGG CCC TGT CAT GGA AAC TGT GAC CCG ACT CAG CAG GAG GCA 2862 
Leu Ala Arg Pro Cys His Gly Asn Cys Asp Pro Thr Gin Gin Glu Ala 
885 890 895 

GGG GGA GGA GAG GCT GTG GTG ACC TTT GAG GAC CGG GCC AGG CAC AGG 2910 
Gly Gly Gly Glu Ala Val Val Thr Phe Glu Asp Arg Ala Arg His Arg 
900 905 910 915 

CAG AGC CAA CGG CGC AGC CGG CAT CGC CGC GTC AGG ACA GAA GGC AAG 2958 
Gin Ser Gin Arg Arg Ser Arg His Arg Arg Val Arg Thr Glu Gly Lys 
920 925 930 

GAG TCC TCT TCA GCC TCC CGG AGC AGG TCT GCC AGC CAG GAA CGC AGT 3006 
Glu Ser Ser Ser Ala Ser Arg Ser Arg Ser Ala Ser Gin Glu Arg Ser 
935 940 945 

CTG GAT GAA GCC ATG CCC ACT GAA GGG GAG AAG GAC CAT GAG CTC AGG 3054 
Leu Asp Glu Ala Met Pro Thr Glu Gly Glu Lys Asp His Glu Leu Arg 
950 955 ' 960 



GGC AAC CAT GGT GCC AAG GAG CCA ACG ATC CAA GAA GAG AGA GCC CAG 
Gly Asn His Gly Ala Lys Glu Pro Thr He Gin Glu Glu Arg Ala Gin 
965 970 975 



3102 



GAT TTA AGG AGG ACC AAC AGT CTG ATG GTG TCC AGA GGC TCC GGG CTG 3150 
Asp Leu Arg Arg Thr Asn Ser Leu Met Val Ser Arg Gly Ser Gly Leu 
98 0 985 990 ™ 995 

GCA GGA GGC CTT GAT GAG GCT GAC ACC CCC CTA GTC CTG CCC CAT CCT 3198 
Ala Gly Gly Leu Asp Glu Ala Asp Thr Pro Leu Val Leu Pro His Pro 
1000 1005 1010 

GAG CTG GAA GTG GGG AAG CAC GTG GTG CTG ACG GAG CAG GAG CCA GAA 3246 
Glu Leu Glu Val Gly Lys His Val Val Leu Thr Glu Gin Glu Pro Glu 
1015 1020 1025 

GGC AGC AGT GAG CAG GCC CTG CTG GGG AAT GTG CAG CTA GAC ATG GGC 3294 
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Gly Ser Ser Glu Gin Ala Leu Leu Gly Asn Val Gin Leu Asp Met Gly 
1030 1035 1040 

CGG GTC ATC AGC CAG AGC GAG CCT GAC CTC TCC TGC ATC ACG GCC AAC 3 342 

Arg Val lie Ser Gin Ser Glu Pro Asp Leu Ser Cys lie Thr Ala Asn 
1045 1050 1055 

ACG GAC AAG GCC ACC ACC GAG AGC ACC AGC GTC ACC GTC GCC ATC CCC 3390 
Thr Asp Lys Ala Thr Thr Glu Ser Thr Ser Val Thr Val Ala lie Pro 
1060 1065 1070 1075 

GAC GTG GAC CCC TTG GTG GAC TCA ACC GTG GTG CAC ATT AGC AAC AAG 3438 
Asp Val Asp Pro Leu Val Asp Ser Thr Val Val His He Ser Asn Lys 
1080 1085 1090 

ACG GAT GGG GAA GCC AGT CCC TTG AAG GAG GCA GAG ATC AGA GAG GAT 34 86 

Thr Asp Gly Glu Ala Ser Pro Leu Lys Glu Ala Glu He Arg Glu Asp 
1095 HOO H05 

GAG GAG GAG GTG GAG AAG AAG AAG CAG AAG AAG GAG AAG CGT GAG ACA 3534 
Glu Glu Glu Val Glu Lys Lys Lys Gin Lys Lys Glu Lys Arg Glu Thr 
1110 1115 1120 

GGC AAA GCC ATG GTG CCC CAC AGC TCA ATG TTC ATC TTC AGC ACC ACC 3582 
Gly Lys Ala Met Val Pro His Ser Ser Met Phe He Phe Ser Thr Thr 
li25 H30 1135 

AAC CCG ATC CGG AGG GCC TGC CAC TAG ATC GTG AAC CTG CGC TAC TTT 363 0 

Asn Pro He Arg Arg Ala Cys His Tyr He Val Asn Leu Arg Tyr Phe 
1140 ~ H45 H50 H55 

GAG ATG TGC ATC CTC CTG GTG ATT GCA GCC AGC AGC ATC GCC CTG GCG 3678 
Glu Met Cys He Leu Leu Val He Ala Ala Ser Ser He Ala Leu Ala 
1160 H65 H70 

GCA GAG GAC CCC GTC CTG ACC AAC TCG GAG CGC AAC AAA GTC CTG AGG 3726 
Ala Glu Asp Pro Val Leu Thr Asn Ser Glu Arg Asn Lys Val Leu Arg 
1175 H80 H85 

TAT TTT GAC TAT GTG TTC ACG GGC GTG TTC ACC TTT GAG ATG GTT ATA 3774 
Tvr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val He 
* 1190 H95 1200 

AAG ATG ATA GAC CAA GGC TTG ATC CTG CAG GAT GGG TCC TAC TTC CGA 3 822 

Lys Met He Asp Gin Gly Leu He Leu Gin Asp Gly Ser Tyr Phe Arg 
1205 1210 1215 

GAC TTG TGG AAC ATC CTG GAC TTT GTG GTG GTC GTT GGC GCA TTG GTG 387 0 

Asp Leu Trp Asn He Leu Asp Phe Val Val Val Val Gly Ala Leu Val 
1220 1225 1230 1235 

GCC TTT GCT CTG GCG AAC GCT TTG GGA ACC AAC AAA GGA CGG GAC ATC 3918 
Ala Phe Ala Leu Ala Asn Ala Leu Gly Thr Asn Lys Gly Arg Asp He 
1240 1245 1250 

AAG ACC ATC AAG TCT CTG CGG GTG CTC CGA GTT CTA AGG CCA CTG AAA 3 966 
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Lys Thr lie Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys 
1255 126O 3 1265 

T*£r iS Si! 7™ n CC ^ G CTC GCC GTC TTC GAC TQ C GTA GTG 

Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val 

1270 1275 12 80 

ACC TCC TTG AAG AAT GTC TTC AAC ATA CTC ATT GTG TAC AAG CTC TTC 

? e L LeU LyS Asn Val Phe Asn lle Leu lle v al Tyr Lys Leu Phe 
1285 1290 1295 

ATG TTC ATC TTT GCT GTC ATC GCA GTT CAG CTC TTC AAG GGA AAG TTC 
Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys Gly Lys Phe 
"°° 1305 1310 i 3 i 5 

TTT TAT TGC ACG GAC AGT TCC AAG GAC ACA GAG AAG GAG TGC ATA GGC 
Phe Tyr Cys Thr Asp Ser Ser Lys Asp Thr Glu Lys GlS Ss He G?? 

1320 1325 " i33 0 

AAC TAT GTA GAT CAC GAG AAA AAC AAG ATG GAG GTG AAG GGC CGG GAA 
Asn Tyr Val Asp His Glu Lys Asn Lys Met Glu Val Lys Gly Arg Glu 
1335 1340 1345 

S 0 , S5f P C ^ ^ ^ TAC GAC ATT ATC TGG GCC CTG CTG 

Trp Lys Arg His Glu Phe His Tyr Asp Asn He He Trp Ala Leu Leu 
" su 1355 1360 

ACC CTC TTC ACC GTC TCC ACA GGG GAA GGA TGG CCT CAA GTT CTG CAG 
Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro SiS Sal 22 Sn 
J - 3t>5 1370 1375 

CAC TCT GTA GAT GTG ACA GAG GAA GAC CGA GGC CCA AGC CGC AGC AAC 
His ser Val Asp Val Thr Glu Glu Asp Arg Gly Pro Ser Sg sir 
1380 1385 1390 1395 

CGC ATG GAG ATG TCT ATC TTT TAT GTA GTC TAC TTT GTG GTC TTC CCC 
Arg Met Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Sro 
1400 1405 1410 

TTC TTC TTT GTC AAT ATC TTT GTG GCT CTC ATC ATC ATC ACC TTC CAG 
Phe Phe Phe Val Asn He Phe Val Ala Leu He He lie Thr J£ Sn 
1415 1420 1425 

GAG CAA GGG GAT AAG ATG ATG GAG GAG TGC AGC CTG GAG AAG AAT GAG 
Glu Gin Gly Asp Lys Met Met Glu Glu Cys Ser Leu Glu Lys Asn Glu 
1430 1435 1440 

AGG GCG TGC ATC GAC TTC GCC ATC AGC GCC AAA CCT CTC ACC CGC TAC 

^ 9 mj** 116 *** PhS Ala Ile Ser Ala Pro Thr Arg Tyr 

1445 1450 1455 

ATG CCG CAG AAC AGA CAC ACC TTC CAG TAC CGC GTG TGG CAC TTT GTG 

7?«n AST1 HiS Thr Phe Gln ^ Arg Val Trp His Phe Val 

X * b0 1465 1470 1475 

GTG TCT CCG TCC TTT GAG TAC ACC ATT ATG GCC ATG ATC GCC TTG AAT 
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Val Ser Pro Ser Phe Glu Tyr Thr lie Met Ala Met lie Ala Leu Asn 
1480 1485 1490 

ACT GTT GTG CTG ATG ATG AAG TAT TAT TCT GCT CCC TGT ACC TAT GAG 46 86 

Thr Val Val Leu Met Met Lys Tyr Tyr Ser Ala Pro Cys Thr Tyr Glu 
1495 1500 1505 

CTG GCC CTG AAG TAC CTG AAT ATC GCC TTC ACC ATG GTG TTT TCC CTG 4734 
Leu Ala Leu Lys Tyr Leu Asn lie Ala Phe Thr Met Val Phe Ser Leu 
1510 1515 1520 

GAA TGT GTC CTG AAG GTC ATC GCT TTT GGC TTT TTG AAC TAT TTC CGA 4 782 

Glu Cys Val Leu Lys Val lie Ala Phe Gly Phe Leu Asn Tyr Phe Arg 
1525 1530 1535 

GAC ACC TGG AAT ATC TTT GAC TTC ATC ACC GTG ATT GGC AGT ATC ACA 483 0 

Asp Thr Trp Asn lie Phe Asp Phe lie Thr Val lie Gly Ser lie Thr 
1540 1545 1550 1555 

GAA ATT ATC CTG ACA GAC AGC AAG CTG GTG AAC ACC AGT GGC TTC AAT 4 878 

Glu lie lie Leu Thr Asp Ser Lys Leu Val Asn Thr Ser Gly Phe Asn 
1560 1565 1570 

ATG AGC TTT CTG AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTC CTG 4 926 

Met Ser Phe Leu Lys Leu Phe Arg Ala Ala Arg Leu lie Lys Leu Leu 
1575 1580 1585 

CGT CAG GGC TAT ACC ATA CGC ATT TTG CTG TGG ACC TTT GTG CAG TCC 4974 
Arg Gin Gly Tyr Thr lie Arg lie Leu Leu Trp Thr Phe Val Gin Ser 
1590 1595 1600 

TTT AAG GCC CTC CCT TAT GTC TGC CTT TTA ATT GCC ATG CTT TTC TTC 5022 
Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu lie Ala Met Leu Phe Phe 
1605 1610 1615 

ATT TAT GCC ATC ATT GGG ATG CAG GTA TTT GGA AAC ATA AAA TTA GAC 5070 
lie Tyr Ala lie lie Gly Met Gin Val Phe Gly Asn lie Lys Leu Asp 
1620 1625 1630 1635 

GAG GAG AGT CAC ATC AAC CGG CAC AAC AAC TTC CGG AGT TTC TTT GGG 5118 
Glu Glu Ser His He Asn Arg His Asn Asn Phe Arg Ser Phe Phe Gly 
1640 1645 1650 

TCC CTA ATG CTA CTC TTC AGG AGT GCC ACA GGT GAG GCC TGG CAG GAG 5166 
Ser Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp Gin Glu 
1655 1660 1665 

ATT ATG CTG TCA TGC CTT GGG GAG AAG GGC TGT GAG CCT GAC ACC ACC 5214 
He Met Leu Ser Cys Leu Gly Glu Lys Gly Cys Glu Pro Asp Thr Thr 
1670 " 1675 1680 

GCA CCA TCA GGG CAG AAC GAG AAT GAA CGC TGC GGC ACC GAT CTG GCC 5262 
Ala Pro Ser Gly Gin Asn Glu Asn Glu Arg Cys Gly Thr Asp Leu Ala 
1685 1690 1695 

TAC GTG TAC TTT GTC TCC TTC ATC TTC TTC TGC TCC TTC TTG ATG CTC 5310 
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Tyr Val Tyr Phe Val Ser Phe He Phe Phe Cys Ser Phe Leu Met Leu 
1700 1705 1710 1715 

AAC CTG TTT GTG GCC GTC ATC ATG GAC AAC TTT GAG TAC CTG ACT CGG 
Asn Leu Phe Val Ala Val He Met Asp Asn Phe Glu Tyr Leu Thr Atq 
1720 1725 1730 

GAC TCC TCC ATC CTG GGG CCT CAC CAC TTG GAC GAG TTT GTC CGC GTC 
Asp Ser Ser He Leu Gly Pro His His Leu Asp Glu Phe Val Arg Val 
1735 1740 1745 

TGG GCA GAA TAT GAC CGA GCA GCA TGT GGC CGC ATC CAT TAC ACT GAG 5454 
Trp Ala Glu Tyr Asp Arg Ala Ala Cys Gly Arg He His Tyr Thr Glu 
17 50 1755 1760 

ATG TAT GAA ATG CTG ACT CTC ATG TCA CCT CCG CTA GGC CTC GGC AAG 

Met :F£5L Glu Met Leu Thr Leu M et Ser Pro Pro Leu Gly Leu Gly Lys 
1765 1770 17?5 * x* 

AGA TGT CCC TCC AAA GTG GCA TAT AAG AGG TTG GTC CTG ATG AAC ATG 
Arg Cys Pro Ser Lys Val Ala Tyr Lys Arg Leu Val Leu Met Asn Met 
1780 1785 1790 1795 

CCA GTA GCT GAG GAC ATG ACG GTC CAC TTC ACC TCC ACA CTT ATG GCT 
Pro Val Ala Glu Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala 
1800 1805 1810 

CTG ATC CGG ACA GCT CTG GAC ATT AAA ATT GCC AAA GGT GGT GCA GAC 5646 
Leu He Arg Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp ~ 
1815 1820 1825 

AGG CAG CAG CTA GAC TCA GAG CTA CAA AAG GAG ACC CTA GCC ATC TGG 
Arg Gin Gin Leu Asp Ser Glu Leu Gin Lys Glu Thr Leu Ala He Trp 
1830 1835 1840 

CCT CAC CTA TCC CAG AAG ATG CTG GAT CTG CTT GTG CCC ATG CCC AAA 5742 
Pro His Leu Ser Gin Lys Met Leu Asp Leu Leu Val Pro Met Pro Lvs 
1845 1850 1855 

GCC TCT GAC CTG ACT GTG GGC AAA ATC TAT GCA GCA ATG ATG ATC ATG 
Ala Ser Asp Leu Thr Val Gly Lys He Tyr Ala Ala Met Met He Met 
1860 1865 1870 1875 

GAC TAC TAT AAG CAG AGT AAG GTG AAG AAG CAG AGG CAG CAG CTG GAG 5838 
Asp Tyr Tyr Lys Gin Ser Lys Val Lys Lys Gin Arg Gin Gin Leu Glu 
1880 1885 1890 

GAA CAG AAA AAT GCC CCC ATG TTC CAG CGC ATG GAG CCT TCA TCT CTG 5886 
Glu Gin Lys Asn Ala Pro Met Phe Gin Arg Met Glu Pro Ser Ser Leu 
1895 1900 1905 

CCT CAG GAG ATC ATT GCT AAT GCC AAA GCC CTG CCT TAC CTC CAG CAG 5 934 

Pro Gin Glu He He Ala Asn Ala Lys Ala Leu Pro Tyr Leu Gin Gin 
1910 1915 1920 

GAC CCC GTT TCA GGC CTG AGT GGC CGG AGT GGA TAC CCT TCG ATG AGT 5982 



5694 



5790 
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Asp Pro Val Ser Gly Leu Ser Gly Arg Ser Gly Tyr Pro Ser Met Ser 
1925 1930 1935 

CCA CTC TCT CCC CAG GAT ATA TTC CAG TTG GCT TGT ATG GAC CCC GCC 603 0 

Pro Leu Ser Pro Gin Asp lie Phe Gin Leu Ala Cys Met Asp Pro Ala 
1940 1945 1950 1955 

GAT GAC GGA CAG TTC CAA GAA CGG CAG TCT CTG GTG GTG ACA GAC CCT 6076 
Asp Asp Gly Gin Phe Gin Glu Arg Gin Ser Leu Val Val Thr Asp Pro 
1960 1965 1970 

AGC TCC ATG AGA CGT TCA TTT TCC ACT ATT CGG GAT AAG CGT TCA AAT 6126 
Ser Ser Met Arg Arg Ser Phe Ser Thr lie Arg Asp Lys Arg Ser Asn 
1975 1980 1985 

TCC TCG TGG TTG GAG GAA TTC TCC ATG GAG CGA AGC AGT GAA AAT ACC 6174 
Ser Ser Trp Leu Glu Glu Phe Ser Met Glu Arg Ser Ser Glu Asn Thr 
1990 1995 2000 

TAC AAG TCC CGT CGC CGG AGT TAC CAC TCC TCC TTG CGG CTG TCA GCC 6222 
Tyr Lys Ser Arg Arg Arg Ser Tyr His Ser Ser Leu Arg Leu Ser Ala 
2005 " 2010 2015 

CAC CGC CTG AAC TCT GAT TCA GGC CAC AAG TCT GAC ACT CAC CCC TCA 62 70 

His Arg Leu Asn Ser Asp Ser Gly His Lys Ser Asp Thr His Pro Ser 
2020 2025 2030 2035 

GGG GGC AGG GAG CGG CGA CGA TCA AAA GAG CGA AAG CAT CTT CTC TCT 6318 
Gly Gly Arg Glu Arg Arg Arg Ser Lys Glu Arg Lys His Leu Leu Ser 
2040 2045 2050 

CCT GAT GTC TCC CGC TGC AAT TCA GAA GAG CGA GGG ACC CAG GCT GAC 6366 
Pro Asp Val Ser Arg Cys Asn Ser Glu Glu Arg Gly Thr Gin Ala Asp 
2055 2060 2065 

TGG GAG TCC CCA GAG CGC CGT CAA TCC AGG TCA CCC AGT GAG GGC AGG 6414 
Trp Glu Ser Pro Glu Arg Arg Gin Ser Arg Ser Pro Ser Glu Gly Arg 
2070 2075 2080 

TCA CAG ACG CCC AAC AGA CAG GGC ACA GGT TCC CTA AGT GAG AGC TCC 6462 
Ser Gin Thr Pro Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu Ser Ser 
2085 2090 2095 

ATC CCC TCT GTC TCT GAC ACC AGC ACC CCA AGA AGA AGT CGT CGG CAG 6510 
He Pro Ser Val Ser Asp Thr Ser Thr Pro Arg Arg Ser Arg Arg Gin 
2100 2105 2110 2115 

CTC CCA CCC GTC CCG CCA AAG CCC CGG CCC CTC CTT TCC TAC AGC TCC 6558 
Leu Pro Pro Val Pro Pro Lys Pro Arg Pro Leu Leu Ser Tyr Ser Ser 
2120 2125 2130 

CTG ATT CGA CAC GCG GGC AGC ATC TCT CCA CCT GCT GAT GGA AGC GAG 66 06 

Leu He Arg His Ala Gly Ser He Ser Pro Pro Ala Asp Gly Ser Glu 
2135 2140 2145 

GAG GGC TCC CCG CTG ACC TCC CAA GCT CTG GAG AGC AAC AAT GCT TGG 6654 
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Glu Gly JjrPro Leu Thr Ser G^Ala Leu Glu Ser to to Ala Trp 



CTG ACC GAG TCT TCC AAC TCT CCG CAC CCC CAG CAG AGG CAA CAT GCC 
Leu Thr Glu Ser Ser Asn Ser Pro His Pro Gin Gin J£ Sa 

2170 2175 



6702 



6750 



6846 



6894 

.. ./ 



6942 



Ser Pro Sn Sf £5° t? C TCC GAG CCC TAC GCC CTG CAC GAA GAC 

Ser Pro Gin Arg Tyr lie Ser Glu Pro Tyr Leu Ala Leu His Glu Asp 

2185 2190 2195 

S 2S S S £ £ 55 £° S £2 SSS E Si E "» 

2200 2205 2210 

vl? a CT £° T AGC CTG GGC CGT TCC ^ ATC GGC TCA GCC CCA 

Ala val Ala Thr Ser Leu Gly Arg Ser Asn Thr He Gly SeV Sa So" 

■ LS 2220 2225 

CCC CTG CGG CAT AGC TGG CAG ATG CCC AAC GGG CAC TAT CGG CGG rar 
Pro Leu Argils ser Trp Gin Me^Pro Asn SJ £° gj gj Sg Arg 

AGG CG f GGG GGG CCT GGG CCA GGC ATG ATG TGT GGG GCT GTC AAC AAC 
Arg jrg Gly Gly Pro Gly Pro Gly Met Met Cys Gly Ala ill JJn 

2250 2255 

SS 25 22 2S S Si SS 21 £5 TMMSCTCC «»»« 

2260 2265 " 2270 

G ATG C ATGCT CTTCTCTCAC ATGGAGAAAA CCAAGACAGA ATTGGGAAGC CAGTGCGGCC 
CCG CGGGGAG GAAGAGGGAA AAGGAAGATG GAAG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2634 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1983 

(D) OTHER INFORMATION: /standard_name= «Beta-2d» 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

vl? GA ° ATG TCC ^ TCT CCT CCC ACA CCG Q CG GCG GCG 4 8 

Met Val Gin Arg Asp Met Ser Lys Ser Pro Pro Thr Pro Ala Ala Ala 



7055 
7089 
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15 10 15 

GTG GCG CAG GAG ATC CAG ATG GAA CTG CTA GAG AAC GTG GCT CCC GCG 96 
Val Ala Gin Glu lie Gin Met Glu Leu Leu Glu Asn Val Ala Pro Ala 
20 25 30 

GGG GCG CTC GGA GCC GCC GCA CAG TCA TAT GGA AAA GGA GCC AGA AGG 144 
Gly Ala Leu Gly Ala Ala Ala Gin Ser Tyr Gly Lys Gly Ala Arg Arg 
35 40 45 

AAA AAC AGA TTT AAA GGA TCT GAT GGA AGC ACG TCA TCT GAT ACT ACC 192 
Lys Asn Arg Phe Lys Gly Ser Asp Gly Ser Thr Ser Ser Asp Thr Thr 
50 ~ 55 60 

TCA AAT AGT TTT GTT CGC CAG GGT TCG GCA GAC TCC TAC ACT AGC CGT 24 0 

Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg 
65 70 75 80 

CCA TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC 288 
Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg 
85 90 95 

AGA GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA 336 
Arg Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr 
100 105 HO 

AAG CCC GTT GCA TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC 384 
Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala 
115 120 125 

CAT GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA 432 
His Glu Asp Asp Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala 
13 0 " 135 140 

AAA GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA 4 80 

Lys Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He 
145 150 155 160 

GGG CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA 528 
Gly Arg Leu Val Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro 
165 ' 170 175 

GTC AAA CTA GAA AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA 576 
Val Lys Leu Glu Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin 
180 185 190 

GGG AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG 624 
Gly Lys Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu 
195 200 205 

GGT GAC ATA GTA CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT 672 
Gly Asp He Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala 
210 215 220 

ATA GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA 72 0 

He Asp He Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro 
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225 230 235 



240 



K ^ ^ * ° TCC CCT *** CCC AGT GCA AAC A GT GTA ACG TCA CCC 
Ala Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro 

245 250 255 

CAC TCC AAA GAG AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT 
Hxs ser Lys Glu Lys Arg Met Pro Phe Phe Lys J£ i£ SS Ss £ 
260 265 270 

pS Pro Tvr SI SI? ^ TA ^ CT ^ CC ATG CGA CCA GTG G ^C CTA GTG GGC 
Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly 

275 280 285 

CCT TCT CTG AAG GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG 
Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Me? 2J iyt Sa 22 
" !yo 295 300 



HI GAT TTT TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG 
Phe Asp Phe Leu Lys His Arg Phe Glu Gly Arg 111 ser S JS g 

310 315 320 

55 i£ SI tT C I CG CTT GCC *** CGC TCG GTA AAC AAT CCC 

Val Thr Ala Asp lie Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro 

325 330 335 

Ser JJS 2s Si tl A tT A ^ AGA TCC ^ ACA AGG TCA AGC TTA GCG 
ser Lys His Ala lie lie Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala 

340 345 3so 

GAA GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG 
Glu Val Gin Ser Glu He Glu Arg He Phe Glu £eJ Sa Arg J£ 22 
355 360 365 

CAG TTG GTG GTC CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC 
Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Sa c£J SS 
370 375 3 8 o 

™l AAA ACC TCC TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT 
Ser Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser 
385 390 395 400 

TTA ^ AGG TTA ATA ^ TCT CGA GGG AAA TCT CAA GCT 
Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala 
405 410 ' 415 

AAA CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT 
Lys His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys 
42 ° 425 - 430 

CCT CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT 
Pro Pro Glu Leu Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp 
435 440 445 * 

GCC TGT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC 
Ala Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr 



768 



616 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 
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450 455 460 

CAT CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA 1440 
His Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu 
465 470 475 480 

GCC ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG 1488 
Ala Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin 
485 490 495 

GGT TCT CAA GGT GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT 1536 
Gly Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro lie Arg Ser 
500 ~ 505 510 

GCT TCC CAA GCT GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC 1584 
Ala Ser Gin Ala Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser 
515 520 525 

CAG CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG 1632 
Gin His Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly 
530 ~ 535 540 

ACA AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG 1680 
Thr Ser Arg Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin 
545 ^ 550 555 560 

GAG AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT 1728 
Glu Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His 
565 570 S75 

GAC CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC 1776 
Asp His Val Asp His Tyr Ala Ser His Arg Asp His Asn His Arg Asp 
580 585 590 

GAG ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT 1824 
Glu Thr His Gly Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg 
595 * 600 605 

TCC CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG 1872 
Ser Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin 
610 *~ ~ 615 620 

CGC AGC CGT CAT AAA TCC AAG GAT CGC TAC TGT QAA AAG GAT GGA GAA 1920 
Arq Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu 
625 630 635 640 

GTG ATA TCA AAA AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT 1968 
Val He Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val 
645 650 655 

TAC ATC CCC CAA TGAGTTTTGC CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA 2 020 

Tyr He Pro Gin 
660 

AGTCTTGTAT AACTAACAGC ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTGCAATC 2080 
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ATATGTGATC TGTCTTGTAA TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG 
GATAGAGTAT TGAGATACTT TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT 
GCAGTCCTCC GGTTGCATAC TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA 
CAAAATCTGT TGCCACCCAG GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCAA 
CAAGCCAGAA TCAGCACAGA TAAAAAGTGG AATTTCTTGT TTCTCCAGAT TTTTAATACG 
TTAATACGCA GGCATCTGAT TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTGTACC 
TCTGGCTGAC TAAATTTGGG GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG 
TTAGAATCTA TTTTCTATGT ACTAGTACTG TGTACTGTAT AGACAGTTTG TAAATGTTAT 
TTCTGCAAAC AAACACCTCC TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC 
ACTATTTTAG AGTC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 69.. 1631 

(D) OTHER INFORMATION: /standard_name= "Beta-4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGCCCAGCCT CGGGGGCCAG CCCCCTCCGC CCACCGCACA CGGGCTGGCC ATGCGGCGGC 

TCTGAACG ATG TCC TCC TCC TCC TAC GCC AAG AAC GGG ACC GCG GAC GGG 
Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly 
15 10 

CCG CAC TCC CCC ACC TCG CAG GTG GCC CGA GGC ACC ACA ACC CGG AGG 
Pro Hxs Ser Pro Thr Ser Gin Val Ala Arg Gly ?nr £ J£ Sg j£g 
" 20 25 30 

AGC AGG TTG AAA AGA TCC GAT GGC AGC ACC ACT TCG ACC AGC TTC ATC 
Ser Arg Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe lie 
35 40 45 

5£ £™ ^ 1°* G ? G GAT TCC TAC ACA AGC AGG CCG TCT GAC TCC 

Leu Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 

50 55 60 



2140 

2200 

2260 

2320 

2380 

2440 

2500 

2560 

2620 

2634 



60 
110 

158 

206 

254 
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GAT GTC TCT TTG GAA GAG GAC CGG GAA GCA ATT CGA CAG GAG AGA GAA 302 
Asp Val Ser Leu Glu Glu Asp Arg Glu Ala lie Arg Gin Glu Arg Glu 
65 70 75 

CAG CAA GCA GCT ATC CAG CTT GAG AGA GCA AAG TCC AAA CCT GTA GCA 350 
Gin Gin Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala 
80 85 90 

TTT GCC GTG AAG ACA AAT GTG AGC TAC TGC GGC GCC CTG GAC GAG GAT 398 
Phe Ala Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp 
95 * 100 105 HO 

GTG CCT GTT CCA AGC ACA GCT ATC TCC TTT GAT GCT AAA GAC TTT CTA 446 
Val Pro Val Pro Ser Thr Ala lie Ser Phe Asp Ala Lys Asp Phe Leu 
115 120 125 

CAT ATT AAA GAG AAA TAT AAC AAT GAT TGG TGG ATA GGA AGG CTG GTG 494 
His He Lys Glu Lys Tyr Asn Asn Asp Trp Trp He Gly Arg Leu Val 

135 14 0 



130 



AAA TCA AGT GGA AAT TCT TCT TCA ACT CTT GGA GAA ATG GTA TCT GGG 
Lys Ser Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Met Val Ser Gly 
175 180 185 

ACA TTC CGA GCA ACT CCC ACA TCA ACA GCA AAA CAG AAG CAA AAA GTG 
T"hr Phe Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val 
195 200 20b 

ACG GAG CAC ATT CCT CCT TAC GAT GTT GTA CCG TCA ATG CGT CCG GTG 
?hr G?u His lie Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val 
210 215 

OTP TTA GTG GGG CCG TCA CTG AAA GGT TAC GAG GTA ACA GAC ATG ATG 
Sa? £eu S3 G?y Pro SeV Leu Lys Gly Tyr Glu Val Thr Asp Met Met 
225 230 

CAG AAA GCC CTC TTT GAT TCC CTG AAG CAC AGG TTT GAT GGG AGG ATT 
Sn Syi Sa Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg He 
240 245 250 

TCA ATA ACG AGA GTG ACA GCT GAC ATT TCT CTT GCT AAG AGG TCT GTC 
Ser~ 52 ?hr Ar£ Val Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val 



255 260 



265 270 

2 ss ss s s an s - s s « s s ss s s 

275 280 



542 



AAA GAG GGC TGT GAA ATT GGC TTC ATT CCA AGT CCA CTC AGA TTG GAG 
Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Leu Arg Leu Glu 
145 150 155 

AAC ATA CGG ATC CAG CAA GAA CAA AAA AGA GGA CGT TTT CAC GGA GGG 590 
Asn lie Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly 
160 165 170 



638 



686 



734 



782 



830 



878 



926 
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295 300 

S5 5 28: = "s = 5S5CSS:s 

310 315 

a a a a s a s is a a a a s a a a 

325 330 

a a a a a s s - s » « 5 «» « 
aaaaaaaasaaaaaa 

360 365 

a a a a a s ;» a s a a a a 
a a a a a a a a a a a a a a 

390 395 

a a a a a a s a a a a a s s a a 

405 410 



AAA GTC 
Lys Val 
335 

AAG TCA 
Lys Ser 



CTT GCA 
Leu Ala 



CAG CTT 
Gin Leu 



GGA AGG 
Gly Arg 
415 

TCT GGG 
Ser Gly 



AAT TTG GGC TCC ACG GCA CTC TCA CCA TAT CCC ana nn* *™ 
Asn Leu Gly Ser Thr Ala Leu Ser ££ JJJ £S S S S 
420 425 430 

a a a a a s a a a a a a a a 

435 4 40 445 

£™ tP 2?* AGA CGA AGT CTA ATG ACC TCT GAT GAA AAT TAT 
Pro lie Glu Arg Arg Ser Leu Met Thr Ser 2K J£ 

455 460 

a a a a a a a a a a a a a a 
a a a a a a a a a a a a a a a a 

485 490 

s2 « G ^ ^ T I AC CCC ^ A< 3G AAC CGA GGA TCA CCT GGG 

Ser Tyr Gin Asp Thr Tyr Lys Pro His Arg Asn Arg Gly SeV PrJ Sy 

500 S05 510 



AAC TCT 
Asn Ser 



CAC AAT 
His Asn 



974 



1022 



1070 



1118 



1166 



1214 



1262 



1310 



1358 



1406 



1454 



1502 



1550 



1598 
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GGA TAT AGC CAT GAC TCC CGA CAT AGG CTT TGAGTCTAAT GAAACAAAAA 1648 
Gly Tyr Ser His Asp Ser Arg His Arg Leu 
515 520 

ATATTCATCT GTTGACAATT TGCCATAGCA GTGCTAGGAT AAACCAATCA TCTTAACTTG 1708 

GCTAACATAG CACAGTATTT ACTGTGCTAA TGGGCTGCTG TCATTTTATG CTAAGTAAGG 1768 

GGCAAAAAAA AAAATTACAT TATGCCCTTG AGTCTAGATG GATATTAGAT GCCCG 1823 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly Pro His 
1 ,5 10 15 

Ser Pro Thr Ser Gin Val Ala Arg Gly Thr Thr Thr Arg Arg Ser Arg 
20 25 30 

Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe lie Leu Arg 
35 ~ 40 45 

Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser Asp Val 
50 55 60 

Ser Leu Glu Glu Asp Arg Glu Ala lie Arg Gin Glu Arg Glu Gin Gin 
65 70 75 80 

Ala Ala lie Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala Phe Ala 
85 ~* 90 95 

Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp Val Pro 
100 105 HO 

Val Pro Ser Thr Ala lie Ser Phe Asp Ala Lys Asp Phe Leu His lie 
115 120 125 

Lys Glu Lys Tyr Asn Asn Asp Trp Trp lie Gly Arg Leu Val Lys Glu 
130 135 140 

Gly Cys Glu lie Gly Phe He Pro Ser Pro Leu Arg Leu Glu Asn lie 
145 150 155 160 

Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly Lys Ser 
165 170 175 

Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Met Val Ser Gly Thr Phe 
180 185 190 
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Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val Thr Glu 

200 205 

His lie Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu 

215 220 
Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys 

230 235 240 

Ala Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg lie Ser He 

245 2S0 255 

Thr Arg Val Thr Ala Asp lie Ser Leu Ala Lys Arg Ser Val Leu Asn 
60 265 270 

Asn Pro ser Lys Arg Ala He lie Glu Arg ser Asn Thr Arg Ser Ser 

280 285 

Leu Ala Glu Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg 

Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala 

310 315 3 20 

Gin Leu He Lys Thr Ser Leu Ala Pro He He Val His Val Lys Val 
■* 25 330 335 

Ser Ser Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser 
° 345 350 

Gin Ser Lys His Leu Asn Val Gin Leu Val Ala Ala Asp Lys Leu Ala 



365 



Gin Cys Pro Pro Glu Met Phe Asp Val He Leu Asp Glu Asn Gin Leu 
• S/U 375 3 8 o 

Glu Asp Ala Cys Glu His Leu Gly Glu Tyr Leu Glu Ala Tyr Trp Arg 
385 390 395 * 40 £ 

Ala Thr His Thr Thr Ser Ser Thr Pro Met Thr Pro Leu Leu Gly Ara 
405 410 4X5 

Asn Leu Gly Ser Thr Ala Leu Ser Pro Tyr Pro Thr Ala He Ser Gly 
42 ° 425 430 

Leu Gin Ser Gin Arg Met Arg His Ser Asn His Ser Thr Glu Asn Ser 
43 5 440 445 

Pro lie Glu Arg Arg Ser Leu Met Thr Ser Asp Glu Asn Tyr His Asn 
450 455 460 

Glu Arg Ala Arg Lys Ser Arg Asn Arg Leu Ser Ser Ser Ser Gin His 
465 470 475 480 

Ser Arg Asp His Tyr Pro Leu Val Glu Glu Asp Tyr Pro Asp Ser Tyr 
485 490 495 
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Gln Asp Thr Tyr Lys Pro His Arg Asn Arg Gly Ser Pro Gly Gly Tyr 
500 505 510 

Ser His Asp Ser Arg His Arg Leu 
515 520 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3346 

(D) OTHER INFORMATION: /standard_name= "Alpha -2 a" 

(ix) FEATURE: 

(A) NAME /KEY : 5 ' UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3 'UTR 

(B) LOCATION: 3347.. 3636 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 52 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 14 8 

Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 7U 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 



292 
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GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu I>ys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

2?* G ?° ~ AG GTT CAA GCA GCT ^ CAG TGG AGA GAA GAT TTT GCA 388 

Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
1Q 5 110 us 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
Hxs He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

£5 ^ o GT GCC TTA GAT GAA GTT AAA AAG AAT CGC GAG 628 

Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 ~ 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 
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AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn lie Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly lie Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys lie He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 12 04 

Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 134 8 

Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 13 96 

Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 14 92 

Glv Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 

4 80 4 85 



475 



GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 154 0 

Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 

495 500 



490 



CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 

510 515 



505 



GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG CCT ATT GGT GTA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Pro He Gly Val 
520 525 530 
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GGT ATA CCA ACA ATT AAT TTA AGA AAA AGG AGA CCC AAT ATC CAG AAC 1684 
Gly lie Pro Thr He Asn Leu Arg Lys Arg Arg Pro Asn He Gin Asn 
535 540 545 550 

CCC AAA TCT CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA 1732 
Pro Lys Ser Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu 
555 560 565 

GAG AAT GAT ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA 1780 
Glu Asn Asp He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu 
570 575 580 

AGT GGA GAA AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA 1828 
Ser Gly Glu Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg 
585 590 595 

TAT ATT GAC AAA GGA AAC AGG ACA TAG ACA TGG ACA CCT GTC AAT GGC 1876 
Tyr He Asp Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly 
600 605 610 

ACA GAT TAC AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT 1924 
Thr Asp Tyr Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr 
615 620 625 630 

ATA AAA GCC AAA CTA GAA GAG ACA ATA ACT CAG GCC AGA TAT TCG GAA 1972 
He Lys Ala Lys Leu Glu Glu Thr He Thr Gin Ala Arg Tyr Ser Glu 
635 640 645 

ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA 2020 
Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe lie Ala 
650 655 660 

CCA AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA 2 068 

Pro Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu 
665 670 675 

TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC 2116 
Phe Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn 
680 685 ^ 690 

CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC 2164 
Pro Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly 
695 700 705 710 

TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC 2212 
Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He 
715 720 725 

AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA 2260 
Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg 
730 735 740 

GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA 2308 
Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr 
745 750 755 
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TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT 2356 
Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val 
760 765 770 

TTC ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG 2404 
Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser 
775 780 785 790 

GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT 2452 
Gly lie Met Val Ser Lys Ala Val Glu lie Tyr lie Gin Gly Lys Leu 
795 800 805 

CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA 2500 
Leu Lys Pro Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He 
810 815 820 

GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT 254 8 

Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val 
825 830 835 

TGT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT 2596 
Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp 
840 ^ 845 850 

GAT GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG 2644 
Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin 
,855 860 865 870 

ATT GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG 2692 
He Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu 
875 " 880 885 

GTT AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA 274 0 

Val Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser 
890 895 900 

GTA TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA 2788 
Val Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser 
905 910 915 

GCA TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC 2836 
Ala Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala 
920 925 930 

ACT GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC 2884 
Thr Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr 
935 940 945 950 

TTT CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG 2932 
Phe Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr 
955 960 965 

GCC TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC 2 98 0 

Ala Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe 
970 975 980 



WO 95/04822 



PCT/US94/09230 



-236- 



TTC GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC 3028 
Phe Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn 
985 990 995 

TGT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA 3 076 

Cys Ser Arg lie Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu lie 
1000 1005 1010 

TTC ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG 3124 
Phe lie Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arq Leu 
1015 1020 1025 1030 

CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG 3172 
Leu lie Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met 
1035 1040 1045 

GTT AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC 322 0 

Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn 
1050 1055 1060 

AAT GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT 3268 
Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn 
1065 1070 1075 

CCC TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG 3316 
Pro Ser Leu Trp Tyr lie lie Gly lie Gin Phe Leu Leu Leu Trp Leu 
1080 1085 1090 

GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT 3363 
Val Ser Gly Ser Thr His Arg Leu Leu 
1095 iioo 



CTGCATAGTT 


AAACTC CAG A 


CCCTGCCAAA ACATGAGCCC 


TGCCCTCAAT 


TACAGTAACG 


3423 


TAGGGTCAGC 


TATAAAATCA 


GACAAACATT AGCTGGGCCT 


GTTCCATGGC 


ATAACACTAA 


3483 


GGCGCAGACT 


CCTAAGGCAC 


C CAC TGG CTG CATGTCAGGG 


TGTCAGATCC 


TTAAACGTGT 


3543 


GTGAATGCTG 


CATCATCTAT 


GTGTAACATC AAAGCAAAAT 


CCTATACGTG 


TCCTCTATTG 


3603 


GAAAATTTGG 


GCGTTTGTTG 


TTGCATTGTT GGT 






3636 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



MOLECULE TYPE: 

FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 



DNA (genomic) 
CDS 

35. .3295 
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(D) OTHER INFORMATION: /standard_name= "Alpha-2c" 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

( ix) FEATURE : 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3296.-3585 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.30: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTC 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 ' u 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG- GTC AGC CTG GCA TTG 
90 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu 
105 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
S*er Sn otu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 "0 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Asn Sp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe lie 
135 140 14:5 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
SS Asp S£a £sn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 

— — 160 



GAG AAA CTT CTG AGC AAC AGA TCT AAA fc><_<- i-xv, o±« --- 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 

rM gcg gag AAA u-i-i wv. — CAC CAG TGG AGA GAA GAT 

Su Sa Glu i£ Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 

110 



52 



100 



148 



196 



244 



292 



340 



388 



436 



484 



532 



155 
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580 



628 



676 



724 



772 



CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
His lie Pro Thr Asp lie Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 lao 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 2.B5 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

aft ^ c CT £ CT GAC ATG CTT ATT CTG GTG GAT G *G AGT GGA 820 

Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Glv 

250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 ~ 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 106 0 

Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 " 390 



868 



916 



964 



1012 
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CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 

Gin His Asn Tyr Glu Arg Gly Pro lie Gin Trp Met Ala Cys Glu Asn 

395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 

Lys Gly Tyr Tyr Tyr Glu lie Pro Ser lie Gly Ala lie Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 1348 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 4 30 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He 
455 460 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 1492 
Asn 
475 



GAA CTG GGA CTT GTC ATT act wa ALi m ~™ ----- 

Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn lie Thr 

465 4/u 

GGC CAA TTT GAA AAT AAU ACA aac us wm AAC CAG CTG ATT CTT GGT 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
480 48= 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 1540 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 4 »5 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Thr 

520 525 530 

TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 1684 
Leu Asp Phe Leu Asp Ala 
535 540 



TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT »n aaa -~ 
Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp He Lys Val Glu He 
535 540 545 

CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 
Arg" As*n £ys Met He Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
555 560 dod 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 
III SaT its Ser Gin Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 
Tyr tS Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
Y 585 590 595 

S S £ 55 w£ SS £ S £ S £j £ SK ^ - 

600 ' 605 610 



1732 



1780 



1828 



1876 
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ATA ACT CAG GCC AGA TCA AAA AAG GGC AAA ATG AAG GAT TCG GAA ACC 
lie Thr Gin Ala Arg Ser Lys Lys Gly Lys Met Lys Asp Ser Glu Thr 
615 "0 625 630 

CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA 
Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe lie Ala Pro 
635 640 645 

AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT 
Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe 
650 655 660 

CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA 
Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro 
665 670 675 

TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT 
Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe 
680 685 690 

^ ^1 ? TT , G r T ? AAT TAC TGG AGT AAG CAG AAA AAT ATC AAG 

Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He Lys 
695 700 705 710 

GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT 
Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg Val 
7X5 720 725 

TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT 
Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tvr 
730 735 740 

GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC 
Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe 
745 750 755 

ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC 
Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Glv 
760 765 " 770 

ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT 
He Met Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu Leu 
775 780 785 790 

AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG 
Lys Pro Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He Glu 
795 800 805 

AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT 
Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val Cys 
810 815 820 

GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT 
Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp Asp 
825 830 835 



1924 



1972 



2020 



2068 



2116 



2164 



2212 



2260 



2308 



2356 



2404 



2452 



2500 



2548 
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GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT 2596 
Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin He 
840 845 850 

GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG GTT 2644 
Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu Val 
855 860 865 870 

AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA 26 92 

Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val 

875 880 885 

TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA 274 0 

Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala 
890 895 900 

TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT 2788 
Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr 
905 910 915 

GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT 2836 
Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe 
920 925 930 

CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC 2884 
Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala 
935 940 945 950 

TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC 2 932 

Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe 
955 960 965 

GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT 2 98 0 

Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys 
970 975 980 

TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC 3 028 

Ser Arg He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe 
985 990 995 

ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC 3076 
He Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu 
1000 1005 1010 

ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT 3124 
He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val 
1015 1020 1025 1030 

AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT 3172 
Lys Gin Pro Arg Tyr Arg Lys Gly Pro Jsp^Val Cys Phe Asp Jsn^Asn 

GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC 322 0 

val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro 
1050 1055 1060 

TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA 3268 
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Ser Leu Trp Tyr He He Gly He Gin Phe Leu Leu Leu Trp Leu Val 
1065 1070 1075 

Ser Sy sir i£ Ss 2u S TGACCTTCTA *~VACCAAAT CTG CATAGTT 33 22 

1080 10 85 

AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC 3382 

TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT 3442 

CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG 3505 

CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG 3562 
GCGTTTGTTG TTGCATTGTT GGT 
(2) INFORMATION FOR SEQ ID MO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
(B) LOCATION: 35.. 3374 (61625 to 1639 & 61908 to 1928) 
(D) OTHER INFORMATION: /standard_name= »Alpha-2d» 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3375.. 3565 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GCGGGGGAGG GGG CATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 52 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 14 8 

Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
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40 45 



50 



155 



CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 
H±l ill Pro Thr Asp lie Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 l" 75 18 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 
G?u Asp Pro Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 

200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 
Ala Arg Tyr Tyr Pro Ala 
215 220 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 
Leu Tyr Asp Val Arg Arg - ~ * — 
235 240 



rrT rra tit TAT CCA GCT TCA CCA TGU uxr ----- - — 

til Sa ser Pro Trp Val Asp Asn Ser Arg Thr Pro 

RAT AAG ATT GAC CTT TAT GAT GTA CGC JWi» AGA u<-k ~" 
5£ S Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr lie Gin 
235 240 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 
Gly Ala Ala Ser Pro Lys Asp Met Leu lie Leu Val Asp Val Ser Gly 
250 255 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 
ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr ser Val Ser giu 



388 



AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 HO US 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe lie 
135 140 145 — ->** 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 

160 l" 



436 



484 



532 



580 



628 



676 



724 



772 



820 



868 
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265 270 



275 



916 



964 



1012 



1060 



1108 



1156 



ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 

G1U Thr Leu Ser As P As P phe Va * Asn Val Ala Ser Phe 

280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 
Ala Lys Gly lie Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu lie Phe i^n 
360 365 370 

Tvr AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 

Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 "~ 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 

395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 13 00 

Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 134 8 

Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 14 92 

Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 154 0 

Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 
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CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala lie Asp Pro Asn 
505 * 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Tnr 

520 525 530 

TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 1684 
Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp lie Lys Val Glu lie 
535 540 545 550 

CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 1732 
Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
555 560 565 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 1780 
Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 580 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 1828 
Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
585 590 595 

TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC AAA CTA GAA GAG ACA 1876 
Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala Lys Leu Glu Glu Thr 
600 * 60S 610 

ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA 1924 
He Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu 

620 625 630 



615 



GAA TCT GGC TAT ACA TTC ATA GCA CCA AGA GAT TAC TGC AAT GAC CTG 1972 
Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp Tyr Cys Asn Asp Leu 
635 640 645 

AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT 2020 
Lys He Ser Asp Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn Glu Phe 
* 650 655 660 

ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT 2068 
He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He 

670 675 



665 



2116 



AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT 
Atn Arg Sal Leu Leu Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn 
680 685 69° 

TAC TGG AGT AAG CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT 2164 
™r £5 Ser Jys Gin Lys Asn He Lys Gly Val Lys Ala Arg Phe Val 

GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA 2212 
val Thr Asp Gly Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu 
715 720 ' 
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AAT TGG CAA GAA AAC CCA GAG ACA TAT GAG GAC AGC TTC TAT AAA A«r 
Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp ter 7^ Lys Arg 

AGC CTA GAT AAT GAT AAC TAT GTT TTC ACT GCT CCC TAC TTT AAC AAA 
Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys 
745 750 755 

AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT GTA 
Ser Gly Pro Gly Ala Tyr Glu Ser Gly He Met Val ¥er l£s SI Saf 

GAA ATA TAT ATT CAA GGG AAA CTT CTT AAA CCT GCA GTT GTT GGA ATT 
Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly lie 

780 785 790 

AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA ATC 
Lys He Asp Val Asn Ser Trp He Glu Asn p£ ?£ iyt sS He 
795 800 805 

AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC AAA AGA AAC AGT GAC 
Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys Arg stl Sp 

810 815 8 20 

mI? %™ I5 T GTG A ? T CTG GAT ^ 601 GGG TTT CTT CTG ATG GCA 
Val Met Asp Cys Val He Leu Asp Asp Gly Gly Phe Leu Leu Met Ala 
825 830 835 

A^n StI G o3 GAT I AT A P GAG ATT GGA AGA TTT TTT GGA GAG ATT 

Asn His Asp Asp Tyr Thr Asn Gin He Gly Arg Phe Phe Gly Glu lie 
840 845 850 

f *Z c GC T 10 5 TG AGA ^ GTG GTT AAT ATA TCA GTT TAT GCT TTT 

Asp Pro Ser Leu Met Arg His Leu Val Asn He Ser Val Tyr Ala Phe 
855 860 865 870 

AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG CCC GGT GCT GCA CCA 
Asn Lys ser Tyr Asp Tyr Gin Ser Val Cys Glu Pro Gly Ala Ala Pro 
875 880 885 

J5i G^ f^t ??* 2?* ^ P C TCA GCA TAT GTG CCA TCA GTA GC * GAC 
Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val Pro Ser Val Ala Asp 

895 900 

ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT GCC TGG TCT ATT CTA 
lie Leu Gin He Gly Trp Trp Ala Thr Ala Ala Ala Trp Ser He Leu 
905 910 



915 



CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA CGA CTC CTT GAG GCA GTT 
Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Arg Leu Leu Glu Ala Val 
92 0 925 



930 



GAG ATG GAG GAT GAT GAC TTC ACG GCC TCC CTG TCC AAG CAG AGC TGC 
Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin Ser Cys 
935 940 945 9 | 0 

ATT ACT GAA CAA ACC CAG TAT TTC TTC GAT AAC GAC AGT AAA TCA TTC 



2260 



2308 



2356 



2404 



2452 



2500 



2548 



2596 



2644 



2692 



2740 



2788 



2836 



2884 



2932 
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Ile Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys Ser Phe 
955 * 960 965 

AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA ATC TTT CAT GGA GAA 2980 
Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg lie Phe His Gly Glu 
970 ~ 975 980 

AAG CTT ATG AAC ACC AAC TTA ATA TTC ATA ATG GTT GAG AGC AAA GGG 3028 
Lys Leu Met Asn Thr Asn Leu He Phe He Met Val Glu Ser Lys Gly 
985 990 995 

ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT 3076 
Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin Ala Glu Gin Thr Ser 
1000 * 1005 1010 

GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA CCT AGA TAC CGA AAA 3124 
Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys 
1015 1020 1025 1030 

GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG GAG GAT TAT ACT GAC 3172 
Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp 
1035 1040 1045 

TGT GGT GGT GTT TCT GGA TTA AAT CCC TCC CTG TGG TAT ATC ATT GGA 3220 
Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu Trp Tyr He He Gly 
1050 1055 1060 

ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG 3268 
He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leu 
1065 1070 1075 

TTA TGACCTTCTA AAAACCAAAT CTGCATAGTT AAACTC CAGA CCCTGCCAAA 3321 
Leu 

ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC TATAAAATCA GACAAACATT 3381 
AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT CCTAAGGCAC CCACTGGCTG 3441 
CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG CATCATCTAT GTGTAACATC 3501 
AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG GCGTTTGTTG TTGCATTGTT 3 561 

GGT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



3564 



(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3289 

(D) OTHER INFORMATION: /standard_name= "Alpha-2e M 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3289.. 3579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 52 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 14 8 

Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu lie Ala Ala Arg Asp lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 no 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 4 84 

Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
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Glu Asp Ala Asn Phe Gly Arg Gin lie Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 628 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 106 0 

Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 110 8 

Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 * 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
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Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
37 5 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro lie Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 1348 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 13 96 

Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 14 92 

Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 1540 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 158 8 

Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Asn Pro Lys Ser 
520 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 1684 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp 
535 540 545 550 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 1732 
He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu 
555 560 565 

AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 1780 
Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp 
570 575 580 

AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 1828 
Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 1876 
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Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr lie Lys Ala 
600 605 610 

AAA CTA GAA GAG ACA ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG 1924 
Lys Leu Glu Glu Thr lie Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys 
615 620 625 630 

CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA AGA GAT 1972 
Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp 
635 640 645 

TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA 2020 
Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe Leu Leu 
650 655 660 

AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT 2 06 8 

Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys 
665 670 675 

AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT 2116 
Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe Thr Asn 
680 685 690 

GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC AAG GGA GTG 2164 
Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He Lys Gly Val 
695 700 705 710 

AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC 2212 
Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg Val Tyr Pro 
715 720 725 

AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT GAG GAC 22 6 0 

Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp 
730 735 740 

AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC ACT GCT 23 08 

Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala 
745 750 755 

CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT ATG 23 56 

Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly He Met 
760 765 770 

GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT AAA CCT 24 04 

Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro 
775 780 785 790 

GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC 2452 
Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He Glu Asn Phe 
795 800 805 

ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC 2500 
Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys 
810 " 815 820 

AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT GGT GGG 2548 
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Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp Asp Gly Gly 
825 830 835 

TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT GGA AGA 2596 
Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin He Gly Ara 
8 *0 845 * 850 

T TT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG GTT AAT ATA 2644 
Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu Val Asn He 
855 860 865 870 

TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG 2692 
Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu 
875 880 885 

CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA TAT GTG 2 74 0 

Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val 
890 895 " 900 

CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT 2788 
Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr Ala Ala 
905 9io 915 

GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA CGA 2836 
Ala Trp Ser lie Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Ara 
920 925 930 

CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACQ GCC TCC CTG 2884 
Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu 
935 940 945 950 

TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC GAT AAC 2932 
Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn 
955 960 965 

GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA 2980 
Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg 
970 975 980 

ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC ATA ATG 3 02 8 

lie Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe He Met 
985 990 995 

GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA 3 076 

Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin 
1000 1005 1010 

GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA 3124 
Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin 

1020 1025 1030 

CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG 3172 
Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu 
1035 1040 1045 

GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC TCC CTG 322 0 
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Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu 
1050 1055 1060 

TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC 3268 
Trp Tyr lie He Gly He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly 
1065 1070 1075 

AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT CTG CATAGTT 3316 
Ser Thr His Arg Leu Leu 

1080 108 

AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCAGC 3376 

TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA GGCGCAGACT 3436 

CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG 34 96 

CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG 3 556 

GCGTTTGTTG TTGCATTGTT GGT 3 579 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1437 

(D) OTHER INFORMATION: /standard_name= "Beta-l-l n 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1435.. 1681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 4 8 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
15 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 96 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 ' ~ 40 45 
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o CC ^ AGC TTT GTC CGC CAG GGC TCA G CG GAG TCC TAC ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 - so 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 24 0 

Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 7 0 75 ~ 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 288 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 336 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 nb 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
i45 150 155 " 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 52 8 

Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 * 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GGT AAT GAA ATG ACT AAC TTA GCC TTT GAA CTA GAC CCC CTA GAG 672 
Ser Gly Asn Glu Met Thr Asn Leu Ala Phe Glu Leu Asp Pro Leu Glu 
210 215 220 

TTA GAG GAG GAA GAG GCT GAG CTT GGT GAG CAG AGT GGC TCT GCC AAG 720 
Leu Glu Glu Glu Glu Ala Glu Leu Gly Glu Gin Ser Gly Ser Ala Lys 
225 230 235 240 

ACT AGT GTT AGC AGT GTC ACC ACC CCG CCA CCC CAT GGC AAA CGC ATC 76 8 

Thr Ser Val Ser Ser Val Thr Thr Pro Pro Pro His Gly Lys Arg He 
245 250 255 

CCC TTC TTT AAG AAG ACA GAG CAT GTG CCC CCC TAT GAC GTG GTG CCT 816 
Pro Phe Phe Lys Lys Thr Glu His Val Pro Pro Tyr Asp Val Val Pro 
260 265 ^ 270 
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TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG GGC TAC GAG 864 
Ser Met Arg Pro lie He Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu 
275 280 285 

GTT AC A GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG AAG CAT CGG 912 
Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg 
290 295 300 

TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT ATT TCC CTG 960 
Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp He Ser Leu 
305 310 315 320 

GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC ATC ATT GAG 1008 
Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He He He Glu 
325 330 335 

CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT GAA ATC GAG 1056 
Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser Glu He Glu 
340 345 350 

CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT CTG GAT GCT 1104 
Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Ala Leu Asp Ala 
355 360 365 

GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC AAG ACC TCG CTG GCC CCC 1152 
Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser Leu Ala Pro 
370 375 380 

ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT CAA AGG CTC 1200 
He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu Gin Arg Leu 
385 390 395 400 

ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT GTC CAA ATA 124 8 

He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn Val Gin He 
405 410 415 

GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG TTT GAC ATC 1296 
Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met Phe Asp He 
420 425 430 

ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT CTG GCG GAG 1344 
He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu 
435 440 445 

TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC AGC ACG CCA 13 92 

Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser Ser Thr Pro 
450 455 460 

CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC CTG GCT 1437 
Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala Leu Ala 
465 470 475 

GCCAGCCCTG CCCCTGTCTC CAACCTCCAG GTACAGGTGC TCACCTCGCT CAGGAGAAAC 14 97 

CTCGGCTTCT GGGGCGGGCT GGAGTCCTCA CAGCGGGGCA GTGTGGTGCC C C AGG AGCAG 1557 
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GAACATGCCA TGTAGTGGGC GCCCTGCCCG TCTTCCCTCC TGCTCTGGGG TCGGAACTGG 1617 
AGTGCAGGGA ACATGGAGGA GGAAGGGAAG AGCTTTATTT TGTAAAAAAA TAAGATGAGC 1677 
GGCA 

(2) INFORMATION FOR SEQ ID NO: 34: 



1681 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..651 

<D) OTHER INFORMATION: /standard name= "Beta-1-4" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 4 8 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 io 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 96 
Glu lie Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 24 0 

Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 ~ 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 268 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 33 6 

Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 ' 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 3 84 

Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala lie Thr Phe 
115 120 125 
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GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His lie Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

-TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu, Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 * 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GAC AGA GCA TGT GCC CCC CTA TGACGTGGTG CCTTCCATGA GGCCCATCAT 678 
Ser Asp Arg Ala Cys Ala Pro Leu 
210 215 



CCTGGTGGGA 


CCGTCGCTCA 


AGGGCTACGA 


GGTTACAGAC 


ATGATGCAGA 


AAGCTTTATT 


738 


TGACTTCTTG 


AAGCATCGGT 


TTGATGGCAG 


GATCTCCATC 


ACTCGTGTGA 


CGGCAGATAT 


798 


TTCCCTGGCT 


AAGCGCTCAG 


TTCTCAACAA 


CCCCAGCAAA 


CACATCATCA 


TTGAGCGCTC 


858 


CAACACACGC 


TCCAGCCTGG 


CTGAGGTGCA 


GAGTGAAATC 


GAGCGAATCT 


TCGAGCTGGC 


918 


CCGGACCCTT 


CAGTTGGTCG 


CTCTGGATGC 


TGACACCATC 


AATCACCCAG 


CCCAGCTGTC 


978 


CAAGACCTCG 


CTGGCCCCCA 


TCATTGTTTA 


CATCAAGATC 


ACCTCTCCCA 


AGGTACTTCA 


1038 


AAGGCTCATC 


AAGTCCCGAG 


GAAAGTCTCA 


GTCCAAACAC 


CTCAATGTCC 


AAATAGCGGC 


1098 


CTCGGAAAAG 


CTGGCACAGT 


GCCCCCCTGA 


AATGTTTGAC 


ATCATCCTGG 


ATGAGAACCA 


1158 


ATTGGAGGAT 


GCCTGCGAGC 


ATCTGGCGGA 


GTACTTGGAA 


GCCTATTGGA 


AGGCCACACA 


1218 


CCCGCCCAGC 


AGCACGCCAC 


CCAATCCGCT 


GCTGAACCGC 


ACCATGGCTA 


CCGCAGCCCT 


1278 


GGCTGCCAGC 


CCTGCCCCTG 


TCTCCAACCT 


CCAGGTACAG 


GTGCTCACCT 


CGCTCAGGAG 


1338 


AAACCTCGGC 


TTCTGGGGCG 


GGCTGGAGTC 


CTCACAGCGG 


GGCAGTGTGG 


TGCCCCAGGA 


1398 


GCAGGAACAT 


GCCATGTAGT 


GGGCGCCCTG 


CCCGTCTTCC 


CTCCTGCTCT 


GGGGTCGGAA 


1458 


CTGGAGTGCA 


GGGAACATGG 


AGGAGGAAGG 


GAAGAGCTTT 


ATTTTGTAAA AAAATAAGAT 


1518 


GAG CGG CA 












1526 



(2) INFORMATION FOR SEQ ID NO: 35: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..660 

(D) OTHER INFORMATION: /standard_name= "Beta-1-5" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCr rar 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro ?£ So S Sn 

5 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Sn g?J £J ™ ser 
° 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACQ TCC TCG GAT 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser tS £er X 
JD 40 45 

^ ^ I CC ^ C AGC TTT GTC CGC GGC TCA GCG GAG TCC TAC ACC 

Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr ?£ 

55 go 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg GlJ JS 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
65 90 95 

^? f£C AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 ~ 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 15 0 155 leo 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 ~ 200 205 

AGT GGT TAC AGA CAT GAT GCA GAA AGC TTT ATT TGACTTCTTG AAG CATCGGT 677 
Ser Gly Tyr Arg His Asp Ala Glu Ser Phe lie 

210 215 220 

TTGATGGCAG GATCTCCATC ACTCGTGTGA CGGCAGATAT TTCCCTGGCT AAGCGCTCAG 73 7 

TTCTCAACAA CCCCAGCAAA CACATCATCA TTGAGCGCTC CAACACACGC TCCAGCCTGG 797 

CTGAGGTGCA GAGTGAAATC GAGCGAATCT TCGAGCTGGC CCGGACCCTT CAGTTGGTCG 857 

CTCTGGATGC TGACACCATC AATCACCCAG CCCAGCTGTC CAAGACCTCG CTGGCCCCCA 917 

TCATTGTTTA CATCAAGATC ACCTCTCCCA AGGTACTTCA AAGGCTCATC AAGTCCCGAG 977 

GAAAGTCTCA GTCCAAACAC CTCAATGTCC AAATAGCGGC CTCGGAAAAG CTGGCACAGT 103 7 

GCCCCCCTGA AATGTTTGAC ATCATCCTGG ATGAGAACCA ATTGGAGGAT GCCTGCGAGC 1097 

ATCTGGCGGA GTACTTGGAA GCCTATTGGA AGG CCACACA CCCGCCCAGC AGCACGCCAC 115 7 

CCAATCCGCT GCTGAACCGC ACCATGGCTA CCGCAGCCCT GGCTGCCAGC CCTGCCCCTG 1217 

TCTCCAACCT CCAGGTACAG GTGCTCACCT CGCTCAGGAG AAACCTCGGC TTCTGGGGCG 12 77 

GGCTGGAGTC CTCAC AG CGG GGCAGTGTGG TGCCCCAGGA GCAGGAACAT GCCATGTAGT 1337 

GGGCGCCCTG CCCGTCTTCC CTCCTGCTCT GGGGTCGGAA CTGGAGTGCA GGGAAC 13 93 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6725 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 226.. 6642 

(D) OTHER INFORMATION: /standard_name= "Alpna-lC-^ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
CTCGAGGAGG CAGTAGTGGA AAGGAGCAGT TTTTGGGGTT 
GGTAATCGTC GGCGGGGAAG AAGAAACGCT GCAGACCACG 
AAAGCCGCCG GCCTCGGAGG AGGGATTAAT CCAGACCCGC 
CTTCCTCTTC GTGGCTGCTC CTCCTATTAA AACCATTTTT 



GAG AAT ACG AGG ATG TAC ATT CCA GAG GAA AAC 
Glu Asn Thr Arg Met Tyr He Pro Glu Glu Asn 
5 10 

TAT GGG AGC CCA CGC CCC GCC CAT GCC AAC ATG 
Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Met 
20 25 30 

GCG GGG CTG GCC CCT GAG CAC ATC CCC ACC CCG 
Ala Gly Leu Ala Pro Glu His He Pro Thr Pro 
40 45 

TGG CAG GCG GCC ATC GAC GCA GCC CGG CAG GCT 
Trp Gin Ala Ala He Asp Ala Ala Arg Gin Ala 
55 60 

GCT GGC AAT GCG ACC ATC TCC ACA GTC AGC TCC 
Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser 
70 75 



TGATGCCATA ATGGGAATCA 

GCTTCCTCGA ATCTTGCGCG 

CGGGGGGTGT TTTCACATTT 

GGTCC ATG GTC AAT 
Met Val Asn 
1 

CAC CAA GGT TCC AAC 
His Gin Gly Ser Asn 
15 

AAT GCC AAT GCG GCA 
Asn Ala Asn Ala Ala 
35 

GGG GCT GCC CTG TCG 
Gly Ala Ala Leu Ser 
50 

AAG CTG ATG GGC AGC 
Lys Leu Met Gly Ser 
65 

ACG CAG CGG AAG CGG 
Thr Gin Arg Lys Arg 
80 



CAG CAA TAT GGG AAA CCC AAG AAG CAG GGC AGC 
Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser 
85 90 



ACC ACG GCC ACA CGC 
Thr Thr Ala Thr Arq 
95 y 



CCG CCC CGA GCC CTG CTC TGC CTG ACC CTG AAG AAC CCC ATC CGG AGG 
Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro He Arg Atq 
100 105 no S nl 

GCC TGC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA ATT ATT TTA 
Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He He He Leu 
120 125 130 

CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT ATT CCC TTT 
Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr He Pro Phe 
135 140 145 

CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA CGA GTG GAA 
Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu Arg Val Glu 
150 155 160 

TAT CTC TTT CTC ATA ATT TTT ACG GTG GAA GCG TTT TTA AAA GTA ATC 
Tyr Leu Phe Leu He He Phe Thr Val Glu Ala Phe Leu Lys Val He 
165 170 175 



60 
120 
180 
234 

282 

330 

378 

426 

474 

522 

570 

618 

666 

714 

762 
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GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAC CTC CGC AAC GGC TGG 810 
Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Arg Asn Gly Trp 
180 - 185 190 195 

AAC CTA CTA GAT TTT ATA ATT GTG GTT GTG GGG CTT TTT AGT GCA ATT 858 
Asn Leu Leu Asp Phe lie lie Val Val Val Gly Leu Phe Ser Ala He 
200 205 210 

TTA GAA CAA GCA ACC AAA GCA GAT GGG GCA AAC GCT CTC GGA GGG AAA 906 
Leu Glu Gin Ala Thr Lys Ala Asp Gly Ala Asn Ala Leu Gly Gly Lys 
215 ' 220 225 

GGG GCC GGA TTT GAT GTG AAG GCG CTG AGG GCC TTC CGC GTG CTG CGC 954 
Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg 
230 235 240 

CCC CTG CGG CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG GTC CTG AAT 1002 
Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn 
245 ' 250 255 

TCC ATC ATC AAG GCC ATG GTC CCC CTG CTG CAC ATC GCC CTG CTT GTG 1050 
Ser He He Lys Ala Met Val Pro Leu Leu His He Ala Leu Leu Val 
260 265 270 275 

CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG CTC TTC ATG 1098 
Leu Phe Val He He He Tyr Ala He He Gly Leu Glu Leu Phe Met 
280 285 290 

GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA GCA GAT GTT 1146 
Gly Lys Met His Lys Thr Cys Tyr Asn Gin Glu Gly He Ala Asp Val 
295 ~ 300 305 

CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG GGC CAC GGG 1194 
Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr Gly His Gly 
310 315 320 

CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG GAT GGT CCC 1242 
Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp Asp Gly Pro 
325 330 335 

AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC ATG CTC ACG 1290 
Lys His Gly He Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr 
340 J 345 350 355 

GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG CTG TAC TGG 1338 
Val Phe Gin Cys He Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp 
360 365 370 

GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT TTT GTT ACA 13 8 6 

Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp He Tyr Phe Val Thr 
375 380 385 

CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT CTC GGT GTG 1434 
Leu He He He Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
390 395 400 
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CTT AGC GGA GAG TTT TCC AAA GAG AGG GAG AAG GCC AAG GCC rrr nra 
Leu Ser Gly Glu Phe Ser Lys Glu Arg Glu JJs Sa JJJ !£ Sg 2?J 
* o:> 410 415 3 * 

2£ ^ G CTG CGG GA< 3 AAG CAG CAG CTA GAA GAG GAT CTC AAA 

Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu Asp SS Jyt 

425 430 435 

GGC TAC CTG GAT TGG ATC ACT CAG GCC GAA GAC ATC GAT CCT GAG AAT 
Gly Tyr Leu Asp Trp He Thr Gin Ala Glu Asp He Asp Pro GlS Asn 
440 445 45 0 

GAG GAC GAA GGC ATG GAT GAG GAG AAG CCC CGA AAC ATG AGC ATG OCC 
Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg A^n Set sS 52 So 
455 460 4g5 

T^ stl ri° % C ^ G TCC GTC AAC ACC GAA AAC GTG GCT GGA GGT GAC 
Thr Ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ala Gly Giy Sp 

T?f 2?* AAC TGC 5(36 GCC AGG CTG GCC CAC CGG ATC TCC AAG 

He Glu Gly Glu Asn Cys Gly Ala Arg Leu Ala His Arg He Se£ ££s 

12 JJJ Se J£? £ ™ ^ 2°° ^ GC CGG TGG ^ T CGG TTC TGC A GA AGG 
ser Lys Phe Ser Arg Tyr Trp Arg Arg Trp Asn Arg Phe Cys Arg Arg 

505 510 * 515 

f^f ^ G e ? GC G ? A GTC ^ TCT AAT GTC TTC TAC TGG CTG GTG ATT 

Lys Cys Arg Ala Ala Val Lys Ser Asn Val Phe Tyr Trp Leu Val lie 
520 525 530 

TTC CTG GTG TTC CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC 
Phe Leu Val Phe Leu Asn Thr Leu Thr He Ala Ser Glu His Tyr £Jn 
535 540 545 

Gin ^ I GG T CTC AGA GAA GTC ^ GAC ACG GCA AAC AAG GCC CTG 

Gin Pro Asn Trp Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu 
550 555 see 

flf, f i° t CTG ll C A S G GCA GAG ATG CTC CTG AAG ATG TAC AGC CTG GGC 
Leu Ala Leu Phe Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Gly 
565 570 575 * 

CTG CAG GCC TAC TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC 
Leu Gin Ala Tyr Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val 
580 585 590 * 595 

SI? Cvl r?£ tT° t CTG S AG ACC ATC CTG GTG GA <5 ACC AAG ATC ATG 

Val Cys Gly Gly He Leu Glu Thr He Leu Val Glu Thr Lys He Met 

600 605 610 

TCC CCA CTG GGC ATC TCC GTG CTC AGA TGC GTC CGG CTG CTG AGG ATT 
Ser Pro Leu Gly He Ser Val Leu Arg Cys Val Arg Leu Leu Arg lie 
615 620 625 



1482 



1530 



1578 



1626 



1674 



1722 



1770 



1818 



1866 



1914 



1962 



2010 



2058 



2106 
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TTC AAG ATC ACG AGG TAC TGG AAC TCC TTG AGC AAC CTG GTG GCA TCC 2154 
Phe Lys lie Thr Arg Tyr Trp Asn Ser Leu Ser Asn Leu Val Ala Ser 
630 635 640 

TTG CTG AAC TCT GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC 2202 
Leu Leu Asn Ser Val Arg Ser lie Ala Ser Leu Leu Leu Leu Leu Phe 
645 650 655 

CTC TTC ATC ATC ATC TTC TCC CTC CTG GGG ATG CAG CTC TTT GGA GGA 2250 
Leu Phe lie lie He Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly 
660 665 670 675 

AAG TTC AAC TTT GAT GAG ATG CAG ACC CGG AGG AGC ACA TTC GAT AAC 22 98 

Lys Phe Asn Phe Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn 
680 685 690 

TTC CCC CAG TCC CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC 2346 
Phe Pro Gin Ser Leu Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp 
695 700 705 

TGG AAT TCG GTG ATG TAT GAT GGG ATC ATG GCT TAT GGC GGC CCC TCT 2394 
Trp Asn Ser Val Met Tyr Asp Gly He Met Ala Tyr Gly Gly Pro Ser 
710 715 720 

TTT CCA GGG ATG TTA GTC TGT ATT TAC TTC ATC ATC CTC TTC ATC TGT 2442 
Phe Pro Gly Met Leu Val Cys He Tyr Phe He He Leu Phe He Cys 
725 730 735 

GGA AAC TAT ATC CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC 2490 
Gly Asn Tyr He Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn 
740 * 745 750 755 

CTG GCT GAT GCT GAG AGC CTC ACA TCT GCC CAA AAG GAG GAG GAA GAG 253 8 

Leu Ala Asp Ala Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu 
760 765 770 

GAG AAG GAG AGA AAG AAG CTG GCC AGG ACT GCC AGC CCA GAG AAG AAA 2586 
Glu Lys Glu Arg Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys Lys 
775 780 785 

CAA GAG TTG GTG GAG AAG CCG GCA GTG GGG GAA TCC AAG GAG GAG AAG 2 634 

Gin Glu Leu Val Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lys 
790 795 800 

ATT GAG CTG AAA TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC 26 82 

He Glu Leu Lys Ser He Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr 
805 " 810 815 

AAG ATC AAC ATG GAT GAC CTC CAG CCC AAT GAA AAT GAG GAT AAG AGC 273 0 

Lys He Asn Met Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser 
820 825 830 835 

CCC TAC CCC AAC CCA GAA ACT ACA GGA GAA GAG GAT GAG GAG GAG CCA 2778 
Pro Tyr Pro Asn Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro 
840 845 850 
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GAG ATG CCT GTC GGC CCT CGC CCA CGA CCA CTC TCT GAG CTT CAC CTT 2826 
Glu Met Pro Val Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu His Leu 
855 860 865 

AAG GAA AAG GCA GTG CCC ATG CCA GAA GCC AGC GCG TTT TTC ATC TTC 2874 
Lys Glu Lys Ala Val Pro Met Pro Glu Ala Ser Ala Phe Phe lie Phe 
870 875 880 

AGC TCT AAC AAC AGG TTT CGC CTC CAG TGC CAC CGC ATT GTC AAT GAC 2922 
Ser Ser Asn Asn Arg Phe Arg Leu Gin Cys His Arg He Val Asn Asp 
885 890 895 

ACG ATC TTC ACC AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT 2970 
Thr lie Phe Thr Asn Leu He Leu Phe Phe lie Leu Leu Ser Ser He 
900 905 910 915 

TCC CTG GCT GCT GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT 3018 
Ser Leu Ala Ala Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His 
920 925 930 

ATT CTG TTT TAT TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA 3066 
He Leu Phe Tyr Phe Asp He Val Phe Thr Thr He Phe Thr He Glu 
935 940 945 

ATT GCT CTG AAG ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT 3114 
He Ala Leu Lys Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser 
950 955 960 

TTC TGC CGG AAC TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG 3162 
Phe Cys Arg Asn Tyr Phe Asn He Leu Asp Leu Leu Val Val Ser Val 
965 970 975 

TCC CTC ATC TCC TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG 3210 
Ser Leu He Ser Phe Gly He Gin Ser Ser Ala He Asn Val Val Lys 
980 985 990 995 

ATC TTG CGA GTC CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG 3258 
He Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg 
1000 1005 1010 

GCC AAG GGG CTA AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG 33 06 

Ala Lys Gly Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg 
1015 1020 1025 

ACC ATC GGG AAC ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT 33 54 

Thr He Gly Asn He Val He Val Thr Thr Leu Leu Gin Phe Met Phe 
1030 1035 1040 

GCC TGC ATC GGG GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA 34 02 

Ala Cys He Gly Val Gin Leu Phe Lys Gly Lys Leu Tyr Thr Cys Ser 
1045 1050 1055 

GAC AGT TCC AAG CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG 34 50 

Asp Ser Ser Lys Gin Thr Glu Ala Glu Cys Lys Gly Asn Tyr He Thr 
1060 1065 1070 1075 
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TAC AAA GAC GGG GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG 34 9 B 

Tyr Lys Asp Gly Glu Val Asp His Pro lie He Gin Pro Arg Ser Trp 
1080 1085 1090 

GAG AAC AGC AAG TTT GAC TTT GAC AAT GTT CTG GCA GCC ATG ATG GCC 3546 
Glu Asn Ser Lys Phe Asp Phe Asp Asn Val Leu Ala Ala Met Met Ala 
1095 1100 1105 

CTC TTC ACC GTC TCC ACC TTC GAA GGG TGG CCA GAG CTG CTG TAC CGC 3 594 

Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Tyr Arg 
1110 1115 1120 

TCC ATC GAC TCC CAC ACG GAA GAC AAG GGC CCC ATC TAC AAC TAC CGT 3642 
Ser He Asp Ser His Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg 
1125 1130 1135 

GTG GAG ATC TCC ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC 36 90 

Val Glu lie Ser He Phe Phe He He Tyr He He He He Ala Phe 
1140 1145 1150 1155 

TTC ATG ATG AAC ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG 3738 
Phe Met Met Asn He Phe Val Gly Phe Val He Val Thr Phe Gin Glu 
1160 1165 1170 

CAG GGG GAG CAG GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA 3786 
Gin Gly Glu Gin Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg 
1175 1180 1185 

CAG TGC GTG GAA TAC GCC CTC AAG GCC CGG CCC CTG CGG AGG TAC ATC 3834 
Gin Cys Val Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He 
1190 1195 1200 

CCC AAG AAC CAG CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC 3 882 

Pro Lys Asn Gin His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr 
1205 1210 * 1215 

TAC TTC GAG TAC CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC 3 930 

Tyr Phe Glu Tyr Leu Met Phe Val Leu He Leu Leu Asn Thr He Cys 
1220 1225 1230 1235 

CTG GCC ATG CAG CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG 3978 
Leu Ala Met Gin His Tyr Gly Gin Ser Cys Leu Phe Lys He Ala Met 
1240 1245 1250 

AAC ATC CTC AAC ATG CTC TTC ACT GGC CTC TTT ACC GTG GAG ATG ATC 4 026 

Asn He Leu Asn Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met He 
1255 1260 1265 

CTG AAG CTC ATT GCC TTC AAA CCC AAG CAC TAT TTC TGT GAT GCA TGG 4 074 

Leu Lys Leu He Ala Phe Lys Pro Lys His Tyr Phe Cys Asp Ala Trp 
1270 1275 1280 

AAT ACA TTT GAC GCC TTG ATT GTT GTG GGT AGC ATT GTT GAT ATA GCA 4122 
Asn Thr Phe Asp Ala Leu He Val Val Gly Ser He Val Asp He Ala 
1285 1290 1295 
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ATC ACC GAG GTA AAC CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG 4170 
lie Thr Glu Val Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met 
1300 1305 1310 1315 

AAC GCA GAG GAA AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC 4218 
Asn Ala Glu Glu Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe 
1320 1325 1330 

CGG GTC ATG CGT CTG GTG AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG 4266 
Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg 
1335 1340 ** ** 1345 

ACG CTG CTG TGG ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG 4314 
Thr Leu Leu Trp Thr Phe He Lys Ser Phe Gin Ala Leu Pro Tyr Val 
1350 1355 1360 

GCC CTC CTG ATC GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG 4362 
Ala Leu Leu He Val Met Leu Phe Phe He Tyr Ala Val He Gly Met 
1365 1370 1375 

CAG GTG TTT GGG AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG 4410 
Gin Val Phe Gly Lys He Ala Leu Asn Asp Thr Thr Glu lie Asn Arg 
1380 1385 1390 1395 

AAC AAC AAC TTT CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AGG 4458 
Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe Arg 
1400 1405 1410 

TGT GCC ACC GGG GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA 4506 
Cys Ala Thr Gly Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro 
1415 1420 1425 

GGC AAG AAG TGT GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT 4 554 

Gly Lys Lys Cys Ala Pro Glu Ser Glu Pro Ser Asn Ser Thr Glu Gly 
1430 1435 1440 

GAA ACA CCC TGT GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC 46 02 

Glu Thr Pro Cys Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe 
1445 1450 1455 

TAC ATG CTC TGT GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC 465 0 

Tyr Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He 
1460 1465 1470 1475 

ATG GAC AAC TTT GAC TAC CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC 4698 
Met Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro 
1480 1485 1490 

CAC CAC CTG GAT GAG TTT AAA AGA ATC TGG GCA GAG TAT GAC CCT GAA 4746 
His His Leu Asp Glu Phe Lys Arg He Trp Ala Glu Tyr Asp Pro Glu 
1495 1500 ' 1505 

GCC AAG GGT CGT ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG 4 794 

Ala Lys Gly Arg He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 
1510 1515 1520 
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ATT CAG CCG CCA CTA GGT TTT GGG AAG CTG TGC CCT CAC CGC GTG GCT 4842 
lie Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala 
1525 1530 1535 

TGC AAA CGC CTG GTC TCC ATG AAC ATG CCT CTG AAC AGC GAC GGG ACA 4 890 

Cys Lys Arg Leu Val Ser Met Asn Met Pro Leu Asn Ser Asp Gly Thr 
1540 ~ 1545 1550 1555 

GTC ATG TTC AAT GCC ACC CTG TTT GCC CTG GTC AGG ACG GCC CTG AGG 4 938 

Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg 
1560 1565 1570 

ATC AAA ACA GAA GGG AAC CTA GAA CAA GCC AAT GAG GAG CTG CGG GCG 4 986 

lie Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala 
1575 1580 1585 

ATC ATC AAG AAG ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG 5034 
He He Lys Lys He Trp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin 
1590 1595 1600 

GTG GTG CCC CCT GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAC 5082 
Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr 
1605 1610 1615 

GCC ACG TTC CTG ATC CAG GAG TAC TTC CGG AAG TTC AAG AAG CGC AAA 5130 
Ala Thr Phe Leu He Gin Glu Tyr Phe Arg Lys Phe Lys Lys Arg Lys 
1620 1625 1630 1635 

GAG CAG GGC CTT GTG GGC AAG CCC TCC CAG AGG AAC GCG CTG TCT CTG 517 8 

Glu Gin Gly Leu Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu 
1640 1645 1650 

CAG GCT GGC TTG CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG 522 6 

Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1655 1660 1665 

GCC ATC TCT GGA GAT CTC ACC GCT GAG GAG GAG CTG GAC AAG GCC ATG 5274 
Ala He Ser Gly Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met 
1670 1675 1680 

AAG GAG GCT GTG TCC GCT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC 5322 
Lys Glu Ala Val Ser Ala Ala Ser Glu Asp Asp He Phe Arg Arg Ala 
1685 1690 1695 

GGT GGC CTG TTC GGC AAC CAC GTC AGC TAC TAC CAA AGC GAC GGC CGG 537 0 

Gly Gly Leu Phe Gly Asn His Val Ser Tyr Tyr Gin Ser Asp Gly Arg 
1700 1705 1710 1715 

AGC GCC TTC CCC CAG ACC TTC ACC ACT CAG CGC CCG CTG CAC ATC AAC 5418 
Ser Ala Phe Pro Gin Thr Phe Thr Thr Gin Arg Pro Leu His He Asn 
1720 1725 1730 

AAG GCG GGC AGC AGC CAG GGC GAC ACT GAG TCG CCA TCC CAC GAG AAG 5466 
Lys Ala Gly Ser Ser Gin Gly Asp Thr Glu Ser Pro Ser His Glu Lys 
1735 1740 1745 
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CTG GTG GAC TCC ACC TTC ACC CCG AGC AGC TAC TCG TCC ACC GGC TCC 5514 
Leu Val Asp Ser Thr Phe Thr Pro Ser Ser Tyr Ser Ser Thr Gly Ser 
1750 1755 i 760 

AAC GCC AAC ATC AAC AAC GCC AAC AAC ACC GCC CTG GGT CGC CTC CCT 5562 
Asn Ala Asn lie Asn Asn Ala Asn Asn Thr Ala Leu Gly Arg Leu Pro 
1765 1770 1775 

CGC CCC GCC GGC TAC CCC AGC ACG GTC AGC ACT GTG GAG GGC CAC GGG 5610 
Arg Pro Ala Gly Tyr Pro Ser Thr Val Ser Thr Val Glu Gly His Gly 
17 80 1785 1790 1795 

CCC CCC TTG TCC CCT GCC ATC CGG GTG CAG GAG GTG GCG TGG AAG CTC 5658 
Pro Pro Leu Ser Pro Ala He Arg Val Gin Glu Val Ala Trp Lys Leu 
1800 1805 1810 

AGC TCC AAC AGG TGC CAC TCC CGG GAG AGC CAG GCA GCC ATG GCG GGT 5706 
Ser Ser Asn Arg Cys His Ser Arg Glu Ser Gin Ala Ala Met Ala Gly 
1815 1820 1825 

CAG GAG GAG ACG TCT CAG GAT GAG ACC TAT GAA GTG AAG ATG AAC CAT 5754 
Gin Glu Glu Thr Ser Gin Asp Glu Thr Tyr Glu Val Lys Met Asn His 
1830 1835 1840 

GAC ACG GAG GCC TGC AGT GAG CCC AGC CTG CTC TCC ACA GAG ATG CTC 5802 
Asp Thr Glu Ala Cys Ser Glu Pro Ser Leu Leu Ser Thr Glu Met Leu 
1845 1850 1855 

TCC TAC CAG GAT GAC GAA AAT CGG CAA CTG ACG CTC CCA GAG GAG GAC 585 0 

Ser Tyr Gin Asp Asp Glu Asn Arg Gin Leu Thr Leu Pro Glu Glu Asp 
1860 1865 1870 1875 

AAG AGG GAC ATC CGG CAA TCT CCG AAG AGG GGT TTC CTC CGC TCT GCC 5B98 
Lys Arg Asp He Arg Gin Ser Pro Lys Arg Gly Phe Leu Arg Ser Ala 
1880 1885 1890 

TCA CTA GGT CGA AGG GCC TCC TTC CAC CTG GAA TGT CTG AAG CGA CAG 5946 
Ser Leu Gly Arg Arg Ala Ser Phe His Leu Glu Cys Leu Lys Arg Gin 
1895 1900 1905 

AAG GAC CGA GGG GGA GAC ATC TCT CAG AAG ACA GTC CTG CCC TTG CAT 5994 
Lys Asp Arg Gly Gly Asp He Ser Gin Lys Thr Val Leu Pro Leu His 
1910 1915 1920 

CTG GTT CAT CAT CAG GCA TTG GCA GTG GCA GGC CTG AGC CCC CTC CTC 6 042 

Leu Val His His Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu 
1925 1930 1935 

CAG AGA AGC CAT TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA 6090 
Gin Arg Ser His Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro 
1940 1945 1950 1955 

CCA GCC ACA CCT GGC AGC CGA GGC TGG CCC CCA CAG CCC GTC CCC ACC 6138 
Pro Ala Thr Pro Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr 
I960 1965 1970 



WO 95/04822 PCT/US94/09230 



-269- 



CTG CGG CTT GAG GGG GTC GAG TCC AGT GAG AAA CTC AAC AGC AGC TTC 6186 
Leu Arg Leu Glu Gly Val Glu Ser Ser Glu Lys Leu Asn Ser Ser Phe 
1975 1980 " 1985 

CCA TCC ATC CAC TGC GGC TCC TGG GCT GAG ACC ACC CCC GGT GGC GGG 6234 
Pro Ser lie His Cys Gly Ser Trp Ala Glu Thr Thr Pro Gly Gly Gly 
1990 1995 2000 

GGC AGC AGC GCC GCC CGG AGA GTC CGG CCC GTC TCC CTC ATG GTG CCC 6282 
Gly Ser Ser Ala Ala Arg Arg Val Arg Pro Val Ser Leu Met Val Pro 
2005 2010 " 2015 

AGC CAG GCT GGG GCC CCA GGG AGG CAG TTC CAC GGC AGT GCC AGC AGC 6330 
Ser Gin Ala Gly Ala Pro Gly Arg Gin Phe His Gly Ser Ala Ser Ser 
2020 2025 2030 2035 

CTG GTG GAA GCG GTC TTG ATT TCA GAA GGA CTG GGG CAG TTT GCT CAA 6378 
Leu Val Glu Ala Val Leu lie Ser Glu Gly Leu Gly Gin Phe Ala Gin 
2040 2045 2050 

GAT CCC AAG TTC ATC GAG GTC ACC ACC CAG GAG CTG GCC GAC GCC TGC 6426 
Asp Pro Lys Phe lie Glu Val Thr Thr Gin Glu Leu Ala Asp Ala Cys 
2055 2060 2065 

GAC ATG ACC ATA GAG GAG ATG GAG AGC GCG GCC GAC AAC ATC CTC AGC 6474 
Asp Met Thr lie Glu Glu Met Glu Ser Ala Ala Asp Asn lie Leu Ser 
2070 2075 2080 

GGG GGC GCC CCA CAG AGC CCC AAT GGC GCC CTC TTA CCC TTT GTG AAC 6522 
Gly Gly Ala Pro Gin Ser Pro Asn Gly Ala Leu Leu Pro Phe Val Asn 
2085 2090 2095 

TGC AGG GAC GCG GGG CAG GAC CGA GCC GGG GGC GAA GAG GAC GCG GGC 6570 
Cys Arg Asp Ala Gly Gin Asp Arg Ala Gly Gly Glu Glu Asp Ala Gly 
2100 2105 2110 2115 

TGT GTG CGC GCG CGG GGT CGA CCG AGT GAG GAG GAG CTC CAG GAC AGC 6618 
Cys Val Arg Ala Arg Gly Arg Pro Ser Glu Glu Glu Leu Gin Asp Ser 
2120 2125 2130 

AGG GTC TAC GTC AGC AGC CTG TAGTGGGCGC TGCCAGATGC GGGCTTTTTT 66 6 9 
Arg Val Tyr Val Ser Ser Leu 
2135 

TTATTTGTTT CAATGTTCCT AATGGGTTCG TTTCAGAAGT GCCTCACTGT TCTCGT 6725 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 502.. 2316 

(D) OTHER INFORMATION: /standard_name= "Beta-2C" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

CAGCAGCGTG CTAAGAAGCA GTCACATAAA CAGCAGCAGG AGTAGGCCTC CTGCTTTTCA 6 0 

AAAGCAGAGT ACTGCAGGGT CGCGAAATGC AAGACACTCA GATGTTTGAA AATCTCCCGA 120 

GTTGAGAATG GCTACTGTAA AAGCGTCACC AAGAAACTCT GACGATCTGG ACAGTCCTAA 18 0 

CTCTGTGTTA GCAATACTTA CTTCCGGAAA ATTAATGCTA CTTCTTGTAG ATTTTTGCAA 24 0 

ATAGGAAACC CCCTTGAAGA AGATCTCAAA TTACGCCCCC CACCCCCAAA AAAAGACAAA 3 00 

CAGGGGAGAA CAAAGTTTTG GCATGCCTGC AGGAACGGTG GCTTTTTTAG AAACTACCTA 360 

GGAGGCAGAA GCTAAGTGAT TTGCTCATGC CTCTTACCTG GGAGTAGAAG GTGGGAAGAA 42 0 

ATGGACCGAG GCTGTGACGA GAAGACAAGG CACAGTGCAG CTTGGTGAAG CCACACGCTG 4 80 

ACTGCGTTCT GCCCCCTCTT C ATG CAG TGC TGC GGG CTG GTG CAT CGC CGG 531 

Met Gin Cys Cys Gly Leu Val His Arg Arg 
15 10 

CGA GTA CGG GTG TCC TAT GGT TCG GCA GAC TCC TAC ACT AGC CGT CCA 579 
Arg Val Arg Val Ser Tyr Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro 
15 20 25 

TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC AGA 627 
Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg 
30 35 40 

GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG 675 
Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys 
45 50 " 55 

CCC GTT GCA TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC CAT 723 
Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His 
60 65 70 

GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA AAA 771 
Glu Asp Asp Val Pro Val Pro Gly Met Ala lie Ser Phe Glu Ala Lys 
75 80 85 90 

GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA GGG 819 
Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp lie Gly 
95 100 105 

CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA GTC 867 
Arg Leu Val Lys Glu Gly Cys Glu lie Gly Phe lie Pro Ser Pro Val 
110 115 120 
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AAA CTA GAA AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA GGG 915 
Lys Leu Glu Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin Gly 
125 13 0 13 5 

AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT 963 
Lys Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly 
140 145 150 

GAC ATA GTA CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT ATA 1011 
Asp lie Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala He 
155 160 165 170 

GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA 1059 
Asp He Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro Ala 
175 180 185 

AAC CAC CGC TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC CAC 1107 
Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His 
190 ~ 195 200 

TCC AAA GAG AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT CCT 1155 
Ser Lys Glu Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro 
205 210 215 

CCG TAT GAT GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG GGC CCT 1203 
Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro 
220 225 230 

TCT CTG AAG GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT 1251 
Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe 
235 " 240 245 250 

GAT TTT TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC 1299 
Asp Phe Leu Lys His Arg Phe Glu Gly Arg He Ser He Thr Arg Val 
255 260 265 

ACC GCT GAC ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT 134 7 

Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser 
270 275 280 

AAG CAC GCA ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA 1395 
Lys His Ala He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu 
285 290 295 

GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG 1443 
Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin 
300 305 310 

TTG GTG GTC CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT 14 91 

Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser 
315 320 325 330 

AAA ACC TCC TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT 1539 
Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro 
335 340 345 
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AAG GTT TTA CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA 158 7 

Lys Val Leu Gin Arg Leu lie Lys Ser Arg Gly Lys Ser Gin Ala Lys 
350 355 ~ "* 360 

CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT 1635 
His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro 
365 370 ^ 375 

CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC 1683 
Pro Glu Leu Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala 
380 385 390 

TGT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT 1731 
Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His 
395 400 405 410 

CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC 1779 
Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala 
415 420 425 

ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT 1827 
Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly 
430 435 440 

TCT CAA GGT GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT 18 75 

Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala 
445 450 455 

TCC CAA GCT GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC CAG 1923 
Ser Gin Ala Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin 
460 465 470 

CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG ACA 1971 
His Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr 
475 480 485 490 

AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG 2019 
Ser Arg Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu 
495 500 505 

AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC 2067 
Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp 
510 515 520 

CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC GAG 2115 
His Val Asp His Tyr Ala Ser His Arg Asp His Asn His Arg Asp Glu 
525 530 ' 53 5 

ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC 2163 
Thr His Gly Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg Ser 
540 545 550 

CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG CGC 2211 
Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg 
555 560 565 570 
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AGC CGT CAT AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG 2259 
Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val 
575 580 585 

ATA TCA AAA AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT TAC 23 0 7 

lie Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr 
590 595 600 

ATC CCC CAA TGAGTTTTGC CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA 2356 
lie Pro Gin 
605 



AGTCTTGTAT 


AACTAACAGC 


ATCCCCAAAA 


CAAAAAGTCT 


TTGGGGTCTA 


CACTGCAATC 


2416 


ATATGTGATC 


TGTCTTGTAA 


TATTTTGTAT 


TATTGCTGTT 


GCTTGAATAG 


CAATAGCATG 


2476 


GATAGAGTAT 


TGAGATACTT 


TTTCTTTTGT 


AAGTG CTACA 


TAAATTGGCC 


TGGTATGGCT 


2536 


GCAGTCCTCC 


GGTTGCATAC 


TGGACTCTTC 


AAAAACTGTT 


TTGGGTAGCT 


GCCACTTGAA 


2596 


CAAAATCTGT 


TGCCACCCAG 


GTGATGTTAG 


TGTTTTAAGA 


AATGTAGTTG 


ATGTATCCAA 


2656 


CAAGCCAGAA 


TCAGCACAGA 


TAAAAAGTGG 


AATTTCTTGT 


TTCTCCAGAT 


TTTTAATACG 


2716 


TTAATACGCA 


GGCATCTGAT 


TTGCATATTC 


ATTCATGGAC 


CACTGTTTCT 


TGCTTGTACC 


2776 


TCTGGCTGAC 


TAAATTTGGG 


GACAGATTCA 


GTCTTGCCTT 


ACACAAAGGG 


GATCATAAAG 


2836 


TTAGAATCTA 


TTTTCTATGT 


ACTAGTACTG 


TGTACTGTAT 


AGACAGTTTG 


TAAATGTTAT 


2896 


TTCTGCAAAC 


AAACACCTCC 


TTATTATATA 


TAATATATAT 


ATATATATCA 


GTTTGATCAC 


2956 


ACTATTTTAG 


AGTC 










2970 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2712 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 223.. 2061 

(D) OTHER INFORMATION: /standard_name= "Beta-2E" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
AGTGTGTGTT TTCAGCCCCT CCTGGAATGG GAAAATAAGA ATCTCCCTGG ATGGGAGTCC ' 6 0 

TCTGGGGCAG GGAGTGAAAG CCCCGGAGGC AGAAAGGGAC GGAGAACAGG GGCTTGCCCA 120 
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GAGCATGGAT AGGAAAGGAG CTGGGGTTCT CCGGGGCTCA GCGCGCACTG AGAACCTGTG 180 

CCCGGGGCTG CAGCTGCGGA CGATAAAGGC GCTGTCTGGC TC ATG AAG GCC ACC 234 

Met Lys Ala Thr 
1 

TGG ATC AGG CTT CTG AAA AGA GCC AAG GGA GGA AGG CTG AAG AAT TCT 282 
Trp lie Arg Leu Leu Lys Arg Ala Lys Gly Gly Arg Leu Lys Asn Ser 
5 10 15 20 

GAT ATC TGT GGT TCG GCA GAC TCC TAC ACT AGC CGT CCA TCC GAT TCC 33 0 

Asp lie Cys Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 
25 30 ~ 35 

GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC AGA GAA GCG GAG 3 78 

Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg Glu Ala Glu 
40 45 50 

CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG CCC GTT GCA 426 
Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys Pro Val Ala 
55 60 65 

TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC CAT GAA GAT GAT 474 
Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His Glu Asp Asp 
70 75 8 0 

GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA AAA GAT TTT CTG 522 
Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala Lys Asp Phe Leu 
85 90 95 ~ 100 

CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA GGG CGA TTG GTA 570 
His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He Gly Arg Leu Val 
105 110 ~ 115 

AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA GTC AAA CTA GAA 618 
Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Val Lys Leu Glu 
120 125 130 

AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA GGG AAA TTC TAC 666 
Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin Gly Lys Phe Tyr 
135 140 145 

TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT GAC ATA GTA 714 
Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly Asp He Val 
150 155 160 

CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT ATA GAC ATA GAT 762 
Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala He Asp He Asp 
165 170 175 180 

GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA AAC CAC CGC 810 
Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro Ala Asn His Arg 
185 190 195 

TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC CAC TCC AAA GAG 858 
Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His Ser Lys Glu 
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200 



205 



210 



AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT CCT CCG TAT GAT 
Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro Pro Tyr Asp 
215 220 225 

GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG GGC CCT TCT CTG AAG 
Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro Ser Leu Lys 
230 235 240 

GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT GAT TTT TTA 
Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 260 

AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC ACC GCT GAC 
Lys His Arg Phe Glu Gly Arg lie Ser He Thr Arg Val Thr Ala Asp 
265 270 275 

ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT AAG CAC GCA 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His Ala 
280 285 290 

ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA GTT CAG AGT 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
295 300 305 

GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG TTG GTG GTC 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Val 
310 315 320 



CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT AAA ACC TCC 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 340 

TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT AAG GTT TTA 
Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro Lys Val Leu 
345 350 355 

CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA CAC CTC AAC 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala Lys His Leu Asn 
360 " 365 370 

GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT CCA GAG CTG 
Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro Pro Glu Leu 
375 380 385 

TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC TGT GAG CAC 
Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
390 395 400 

CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT CCT CCC AGC 
Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 420 

AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC ACT TCA AGT 
Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala Thr Ser Ser 



906 



954 



1002 



1050 



1098 



1146 



1194 



1242 



1290 



1338 



1386 



1434 



1482 



1530 
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425 430 435 

CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT TCT CAA GGT 1578 
Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly Ser Gin Gly 
440 445 450 

GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT TCC CAA GCT 1626 
Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala Ser Gin Ala 
455 460 ~ 465 

GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC CAG CAC CGC TCT 1674 
Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin His Arg Ser 
470 475 480 

TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG ACA AGT CGC GGC 1722 
Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr Ser Arg Gly 
485 490 495 500 

CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG AGT CGA GAC 1770 
Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu Ser Arg Asp 
505 510 515 

TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC CAC GTG GAC 18 IB 

Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp His Val Asp 
520 525 530 

CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC GAG ACC CAC GGG 1866 
His Tyr Ala Ser His Arg Asp His Asn His Arg Asp Glu Thr His Gly 
535 540 * 545 

AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC CGG GAC GTG 1914 
Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg Ser Arg Asp Val 
550 555 * 560 

GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG CGC AGC CGT CAT 1962 
Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg Ser Arg His 
565 570 575 580 

AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG ATA TCA AAA 2010 
Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val He Ser Lys 
585 590 595 

AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT TAC ATC CCC CAA 2058 
Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr He Pro Gin 
600 605 ~ * 610 

TGAGTTTTGC CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA AGTCTTGTAT AACTAACAGC 2118 

ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTGCAATC ATATGTGATC TGTCTTGTAA 217 8 

TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG GATAGAGTAT TGAGATACTT 223 8 

TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT GCAGTCCTCC GGTTGCATAC 2298 

TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA CAAAATCTGT TGCCACCCAG 23 58 
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GTGATGTTAG 


TGTTTTAAGA 


AATGTAGTTG 


ATGTATCCAA 


CAAGCCAGAA 


TCAGCACAGA 


2418 


TAAAAAGTGG 


AATTTCTTGT 


TTCTCCAGAT 


TTTTAATACG 


TTAATACGCA 


GGCATCTGAT 


2478 


TTGCATATTC 


ATTCATGGAC 


CACTGTTTCT 


TGCTTGTACC 


TCTGGCTGAC 


TAAATTTGGG 


2538 


GACAGATTCA 


GTCTTGCCTT 


ACACAAAGGG 


GATCATAAAG 


TTAGAATCTA 


TTTTCTATGT 


2598 


ACTAGTACTG 


TGTACTGTAT 


AGACAGTTTG 


TAAATGTTAT 


TTCTGCAAAC 


AAACACCTCC 


2658 


TTATTATATA 


TAATATATAT 


ATATATATCA 


GTTTGATCAC 


ACTATTTTAG 


AGTC 


2712 
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WHAT IS CLAIMED IS: 

1- An isolated DNA fragment, comprising a sequence of 
nucleotides that encodes an a x subunit selected from the group 
consisting of 0f 1A ^, oi^ 2t a 1E . lt or lc _ 2 and a 1E _ 3 . 
5 2. The DNA fragment of claim l, wherein the a a subunit 

is or 

3. The DNA fragment of claim 1, wherein the a ± subunit 
is or^ or a 1E _ 3 . 

4. The DNA fragment of claim l, wherein the a x subunit 
10 is a lc _ 2 . 

5. An isolated DNA fragment, comprising a sequence of 
nucleotides that encodes a 0 subunit selected from the group 
consisting of 0 2 , £ 3 and £ 4 . 

6. The DNA fragment of claim 5, wherein the subunit is 
15 a 0 2c , /3 2D or 0 2E subunit. 

7. The DNA fragment of claim 5, wherein the subunit is 
a 0 3 subunit. 

8. The DNA fragment of claim 7, wherein the subunit is 
a 0 3 _ x subunit. 

20 9- The DNA fragment of claim 5, wherein the subunit is 

a 0 4 subunit. 

10. The DNA fragment of claim 9, wherein the subunit has 
an amino acid sequence set forth in SEQ ID No. 28. 

11. A eukaryotic cell, comprising heterologous DNA that 
25 encodes an a 2 subunit selected from the group of subunit s 

consisting of a 1A . a , a^, a lc _ 2 , a^, and a 1E _ 3 . 

12. A eukaryotic cell, comprising heterologous DNA that 
encodes an subunit and heterologous DNA that encodes a (3 
subunit, wherein at least one subunit is selected from the 

30 group of subunits consisting of a 1A _ 1# a 1A _ 2 , <* 1C _ 2 , a 1E _ lf <y 1E _ 3 , 0 2C , 
02D> $3-1* a /3 4 subunit. 

13. The eukaryotic cell of claim 12, wherein the 0 
subunit is a (3 2 subunit. 

14. The eukaryotic cell of claim 12, wherein the (3 
35 subunit is a (3 A subunit. 
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15. The eukaryotic cell of claim 11, selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, and mouse L cells. 

16 . The eukaryotic cell of claim 12 selected from the 
5 group consisting of HEK 293 cells, Chinese hamster ovary 

cells, African green monkey cells, and mouse L cells. 

17. A eukaryotic cell with a functional, heterologous 
calcium channel, produced by a process comprising: 

introducing into the cell heterologous nucleic acid that 
10 encodes an a a -subunit of a human calcium channel, wherein: 

the <*! subunit is selected from the group consisting of 
<*ia-i> <*ia- 2 / <* xc _ 2 , Q^iE-i and ar 1E _ 3 ; 

the heterologous calcium channel contains at least one 
subunit encoded by the heterologous nucleic acid; and 
15 the only heterologous ion channels are calcium channels. 

18. A eukaryotic cell with a functional, heterologous 
calcium channel , produced by a process comprising: 

introducing into the cell nucleic acid that encodes an 
cy 1 subunit of a human calcium channel and introducing into the 
20 cell nucleic acid that encodes a 0 subunit of a human calcium 
channel , wherein : 

at least one of the subunits is s elected from the group 
consisting of a 1A . 1# o^, a lz . lt or 1E _ 3 , 0 ac , /8 2D , fi 2E , a £ 3 and a £ 4 
subunit ; 

25 the heterologous calcium channel contains at least one 

subunit encoded by the heterologous nucleic acid; and 

the only heterologous ion channels are calcium channels. 

19. The eukaryotic cell of claim 17 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 

3 0 cells, African green monkey cells, mouse L cells and amphibian 
oocytes . 

20. The eukaryotic cell of claim 18 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, mouse L cells and amphibian 

3 5 oocytes. 
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21- The eukaryotic cell of claim 18, wherein the 13 
subunit is a 0 2 , /3 3 or 0 4 subunit of a human calcium channel. 

22 . The eukaryotic cell of claim 18, wherein the calcium 
channel includes an a 2b subunit of a human calcium channel, an 

5 ®ib-i subunit of a human calcium channel and a £ 3 subunit of a 
human calcium channel . 

23. The eukaryotic cell of claim 18, wherein the calcium 
channel includes an a lh _ lt a 2b , and a p x . 2 subunit, or an or XB . a , 
a 2b , and a subunit, or an a 1B _ 2 , or 2b , and a /S 1 . 3 subunit, or 

10 an c* 1A _ 2 , a 2h , and a 0 3 _ a subunit, or a of 1B _ 1# a 2b , and an /3 3 _ a 
subunit . 

24. The eukaryotic cell of claim IB, wherein the 
calcium channel contains an a 2b subunit of a human calcium 
channel, an a 1B or an a 1D subunit of a human calcium channel and 

15 a P x . 2 or subunit of a human calcium channel. 

25. A method for identifying a compound that modulates 
the activity of a calcium channel, comprising; 

suspending a eukaryotic cell that has a functional, 
heterologous calcium channel, in a solution containing the 
20 compound and a calcium channel -selective ion: 

depolarizing the cell membrane of the cell; and 
detecting the current flowing into the cell, 
wherein: 

the heterologous calcium channel includes at least one 
25 human calcium channel subunit encoded by DNA or RNA that is 

heterologous to the cell; 

at least one subunit is selected from the group 

consisting of a 1A . a , a 1A _ 2 , a 1E . lt of 1E . 3 , a ic . 2 , 0 2C , /? 2D , £ 2E , a jS 3 

subunit and a p 4 subunit; 
3 0 the current that is detected is different from that 

produced by depolarizing the same or a substantially identical 

cell in the presence of the same calcium channel selective ion 

but in the absence of the compound. 

26. The method of claim 25, wherein the heterologous DNA 
35 or RNA encodes a (3 2 subunit. 
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27. The method of claim 26, wherein the heterologous DNA 
or RNA encodes a 0 4 subunit. 

28. A subunit -specific antibody selected from the group 
consisting of antibodies that bind to an or subunit type or a 

5 subunit subtype of a human calcium channels, wherein the 
subunit is an ot x subunit. 

29. The antibody of claim 28, wherein antibody is 
subtype specific and the a y subunit is a 1A , a 1E and or 1B . 

30. An RNA or single-stranded DNA probe of at least 16 
10 bases in length comprising at least 16 substantially 

contiguous bases from nucleic acids that encode a subunit of 
a human calcium channel selected from the group of subunits 
consisting of c*^, o^.j, ot 1E . 10 <* 1C _ 2 , <*i E -3/ £3-1* 02c* 0 2 e and 

15 31. The probe of claim 3 0 that contains at least 30 

bases that are from nucleic acids that encode a subunit of a 
human calcium channel selected from the group of subunits 
consisting of a x ^ lt cx^, a 1E . a , a ic . 2 , a 1E _ 3 , /8 3 . 1# /3 2C , 0 2D , /? 2E and 
P A subunits. 

20 32. A method for identifying nucleic acids that encode 

a human calcium channel subunit, comprising hybridizing under 
conditions of at least low stringency a probe of claim 3 0 to 
a library of nucleic acid fragments, and selecting hybridizing 
fragments. . 

25 33. A method for identifying cells or tissues that 

express a calcium channel subunit -encoding nucleic acid, 
comprising hybridizing under conditions of at least low 
stringency a probe of claim 30 with mRNA expressed in the 
cells or tissues or cDNA produced from the mRNA, and thereby 

30 identifying cells or tissue that express mRNA that encodes the 
subunit . 

34. A substantially pure human calcium channel subunit 
selected from the group consisting of a x ^ lf a 1A _ 2 , 0f 1E . 1# af lc - 2 ' a 1E _ 
3, 03.i, /S 2C , 0 2D , /3 2E and 0 4 . 
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